BEST AVAILABLE COPY 

t 

_J j _ 

(1 2) PATENT ABRIDGMENT (1 1 ) Document no AU-B-21 1 72/88 
(1 9) AUSTRALIAN PATENT OFFICE (1 0) Accede no 61 3944 



(54) Title 

RECOMBINANT HTLV-iH PROTEINS AND USES THEREOF 

international Patent Ciassification(s) 
(51 }* C07K 013/00 

(51)* A61K 037/02 C12N 015/00 C12N 015/49 

C12P 021/00 

(21) Application No. : 21172/&8 (22) Application Date ■ 18.08.88 

(30) Priority Data 

(31) Number (32) Date {33) Country 

107231 09.10.87 US UNITED STATES OF AMERICA 

(43) Publication Date : 13.04.89 

(44) Publication Date of Accepted Application : 1 5.08.91 

(71) Appiicant(s) 
REPUCEN CORPORATION 

(72) Inventcr(s) 

SCOTT D. PUTNEY; DEBRA LYNN; KASHAYAR JAVAHERJAN; WILLIAM T. MUELLER; JOHN 
FARLEY 

(74) Anomey or Agent 

SPRUSON A FERGUSON , GPO Box 3898, SYDNEY NSW 2001 

{ 56) Prior Art Documents 
AU 76358/87 
AU 71032/87 
AU 56363/86 

(57> Claim 



1- A process for stimulating a lymphocyte proliferative response in 
humans which comprises treating humans in need of stimulation of a 
lymphocyte proliferative response with a recombinant HIV portion of an 
HTLV-III protein selected from the group consisting of RIO, PB1 , 590 and 
KH1, wherein members of said group are as hereinbefore described. 



F 0 RM 1 SPRUSON & FERGUSON 

1 




COMMONWEALTH ^AJSTRWDA 
PATENTS ACT 1952 

APPLICATION FOR A STANDARD PATENT 



Repligen Corporation, of One Kendall Square, Cambridge, Massachusetts, 02139, 
UNITED STATES OF AMERICA, hereby apply for the grant of a standard patent for 
an invention entitled: 

Recombinant HTLV-III Proteins and Uses Thereof 

which is described in the accompanying complete specification. 

Details of basic appl ication(s) :- 

Basic Applic. Ho: Country: Application Date: 

107231 US 9 October 1987 

The address for service is:- 

Spruson & Ferguson 
Patent Attorneys 
Level 33 St Martins Tower 
31 Market Street 

Sydney New South Males Australia 

DATED this EIGHTEENTH day of AUGUST 1933 
Repligen Corporation 

By: 



MM 

Registered Patent Attorney 



TO: THE COMMISSIONER OF PATENTS 
OUR REF: 56873 
S&F CODE: 61175 



5845/4 



spnison & Ferguson 



Titte of Invention 



Full nime(s) and 
atfi5re»(es) of 
Deciaxant(s) 



Full name(j> of 
Applicants) 



COMMONWEALTH OF AUSTRALIA 

THE PATENTS ACT 19S2 

DECLARATION IN SUPPORT OF A 
CONVENTION APPLICATION FOR A PATENT 

In support ot the Convention Application made for a 
patent for an invention entitled: 

Recombinant KTLV-III Proteins and Uses Thereof 



Thomas H. Fraser 



Repligen Corporation 
of One Kendall Square 

Building 700 

Cambridge, Massachusetts 02139 USA 
do solemnly and sincerely declare as follows :- 

1 . barR/^x^*ks*£^^ 

(or, in the case of an application by a body corporate) 
\, 1 am/We-8fe authorised by . Repligen Corporation 



AUSTRALIA 

17 ANL,A*D 

DECLARAT ton 

',fP< 



9 9 » 

• ©• 



BHsic Ountry(les) 
Priorit/Daiefs) 

CO 0 
* • 9 

Basic Applicant(s) 



ttexippltcastt^aKrto 
i fc^Uxoax itckaUicx 

The basic application^} as defined by Section 141 of the 
Ac i was/wefe made 

in The United States of America U.S. Ser. No. 107,231 
on October 9, 1987 

by Scott D. Futney, Debra Lynn, Kashayar Javaherian, William T. 
Mueller, and John Farley 



Fult>8ame(s) and 
•jy«resH«s) of 
invehtor(s) 



• 

» o • 



* o » 
» © • 

9 3 J 



Set out how Applicant^ 
derive title frojn actual 
inventor(s) e.g. The 
Applicant(s) is/are Che 
assi3nee(s) of ihe 
invention from the 
inventor fs) 



3. raHWttS^^ 

(or where a person other than the inventor is the applicant) 

3. Scott D. Putney, Debra Lynn, Kashayar Javaherian, William T. Mueller, 
and John Farley 

of 5 Epping St., Arlington, MA 02174 USA; 11 Allen St., Apt. 11, ArUngto: 
MA 0*2174 USA; 27 Webster Rd., Lexington, MA 02173 USA; 26 Copeland Sr.. 
Watertovm, MA 02172 USA; and 261 Culver Rd. , #9, Rochester, NY 14607 III 

(respectively) 

is/are the actual inventor(s) of the invention and the facts upon 
which the applicants i5/"«*e-entitied to make the application are 
as follows: 

The Applicant is the assignee of the invention from the inventors. 



SFP4 



4. The basic application^ referred to in paragraph 2 of this 

Declaration was/we*? the first application^) made in a Convention 
country in respect of the invention^ the subject of the application. 

Declared at Cambridge, MA thisX SPi Q day of /rf^s^Jf 19 J/ 



io: The Commissioner of Patents 



Signature of Declarant(s) i \ i\ 

Thosias H. Fraser, Executive Vice President 



Repligen Corporation 



S & F Ref : 56873 



FORM 10 



COMMONWEALTH OF AUSTRALIA 
PATENTS ACT 1952 ££ 
COMPLETE SPECIFICATION^ 




(ORIGINAL) 



FOR OFFICE USE: 



Class 



Int Class 



Complete Specification Lodged: 
Accepted: 
Published: 

Priority: 
Related Art: 



Name and Address 

of Applicant: Repligen Corporation 



Address for Service: Spruson & Ferguson, Patent Attorneys 

Level 33 St Martins Tower, 31 Market Street 
Sydney, New South Wales, 2000, Australia 



Complete Specification for the invention entitled: 



The following statement is a full description of this invention, Including the 
best method of performing it known to me /us 



One Kendal 1 Square 
Cambridge Massachusetts 02139 
UNITED STATES OF AMERICA 



Recombinant HTLV-III Proteins and Uses Thereof 



5845/3 



ABSTRACT 

A process for stimulating a lymphocyte proHferative response in 
humans which comprises treating humans In need of stimulation of a 
lymphocyte proliferative response with a recombinant HIV portion of an 
5 HTLV-III protein selected from the group consisting of RIO, PB1 , 590 and 
KH1 , wherein members of said group are as hereinbefore described. 
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DESCRIPTION 

RECOMBINANT HTLV-III PROTEINS 
AND USES THEREOF 



Background of the Invention 

Human T-cell lymphotropic virus (HTLV-III) , 
lymphadenopathy-associated virus (LAV), or AIDS- 
associated retrovirus ( AR V } has been identified as 
the cause of acquired immune deficiency syndrome (AIDS) 
(Popovic, M. , Sarngadharan , M.G., Read, E . and Gallo, 
R.C. , [1984] Science 224:497-500). The virus displays 
tropism for the OKT4 ' lymphocyte subset (Klatzmann , 
D. , Barre-Sinoussi , F . , Nugeyre , M.T., Dauguet , C, 
Vilmer, E. , Griscelli , C. t Brun-Vezinet , F. , Rouzioux , 
C . , Glue km an , J.C., Chermann, J.C- and Montagnier , L. 
[1984] Science 225 : 59-63) . Antibodies against HTLV-III 
proteins in. the sera of most AIDS and AIDS related 
complex (ARC) patients , and in asymptomatic people 
infected with, Che virus (Sarngadharan, H.G., Popovic, 
M. , Bruch, L. , Schupbach , J. and Gallo, R.C. [1984] 
Science 224:506-508) have made possible the developmen 
of immunologically based tests that detect antibodies 
to these antigens. These tests are used to limit the 
spread of HTLV-III through blood transfusion by. identi 
fying blood samples of people infected with the virus. 
Diagnostic tests currently available commercially use 
the proteins of inactivated virus as antigens . 
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In addition to allowing new approaches for diag- 
nosis, recombinant DNA holds great promise for the 
development of vaccines against both bacteria and 
viruses (Wilson, T. [1984] Bio /Techno logy 2-29-39). 
The most widely employed organisms to express recom- 
binant vaccines have been E. coli , S. cerevisiae 
and cultured mammalian cells . For example , subunit 
vaccines against foot and mouth disease (Kleid, D.G- , 
Yansura, D- , Small, B. , Dovbenkc , D. , Moore, D.M, , 
Brubman, M.J., McKercher , P.D., Morgan D.O . , Robertson, 
B.H. and Bachrach, H.L. (1981] Science 214:1125-1129) 
and malaria (Young, J.F., Hockmeyer, W.T., Gross, M. ? 
Ripley Ballou, W. , Wirtz, R. A. , Trosper , J.H- , Beaudoin, 
R.L., Hollingdale, M.R. , Miller, L.M. , Diggs, C,L. 
and Rosenberg, M. [1985] Science 228:958-962) have 
been synthesized in E. coli . Other examples are 
hepatitis B surface antigen produced in yeast (McAleer , 
W.J.. t Buynak, E.B., Maigetter, R.Z., Wampler, D.E., 
Miller, W.J, and Hilleman, M.R. [1984] Nature 307: 
178-180) and a herpes vaccine produced in mammalian 
cells (Berman, P.W. , Gregory, T. , Chase, D. andLasky, 
L.A. [1984] Science 227:1490-1492). 

There is a real need at this time to develop a 
vaccine for AIDS- No such vaccine is known to exist. 

Brief Summary of the Invention 

The subject invention concerns novel recombinant 
HTLV-III proteins and the uses thereof. More specifi- 
cally, the subject invention concerns novel 
recombinant HTLV-TII envelope proteins which can be 
used in the diagnosis, prophylaxis or therapy of AIDS. 
Further, the recombinant HTLV-III envelope protein 
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fragments of the invention can be used to stimulate 
a lymphocyte proliferative response in HTLV-III in- 
fected humans. This then would stimulate the immune 
system to respond to HTLV-III in such individuals and, 
therefore, the envelope protein fragments can provide 
protection and be of therapeutic value- These novel 
proteins are encoded on bacterial plasmids which can be 
used to transform suitable hosts, for example, E. coli , 
using standard procedures. 

Reference to the Drawings 

FIGURE 1 — This is a flow chart of the construction 

of plasmid pREV2. 2 which is used to construct 

vectors encoding novel proteins . 
FIGURE 2--This is a diagram of plasmid pREV2.2 showing 

the multiple cloning site. 
FIGURE 3- -This is a schematic of the KTLV-III envelope 
♦ gene and the novel recombinant proteins 

obtained therefrom. 
FIGURE 4- -Drawing showing the removal of N- terminal 

non-HTLV-III sequences of P31. 
FIGURE 5 — Drawing showing the removal of C- terminal 
. non-HTLV-III sequences from FBI. 

Detailed Disclosure of the Invention 

Expression vector plasmid pREV2.2 was constructed 
from plasmid pBGl. The flow chart showing the construc- 
tion of this plasmid is given in Figure 1 of the drawings- 

Plasmid pRIO contains approximately 1275 base pairs 
of DNA encoding the HTLV-III env gene from essentially 
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the Kpnl site to the Bglll site. This plasmid in a 
suitable bacterial host, e.g., E. coli , can be used 
to produce the novel recombinant' HTLV-III 95 kD fusion 
protein denoted RIO. The amino acid sequence of 
fusion protein RIO is shown in Table 8; the DNA 
sequence encoding this protein is shown in- Table 8A. 
The amino acid sequence of the HIV portion of protein RIO 
is shown in Table 12. The DNA sequence encoding the HIV 
portion of protein RIO is shown in Table 12A. 

Plasmid pPBl contains approximately 540 base pairs 
of DNA encoding essentially the HTLV-III env gene from 
the PvuII site to the Bgl ll site. This plasmid in a 
suitable host, e.g., E. coli , can be used to produce 
the novel recombinant HTLV-III 26 kD fusion protein 
denoted PB1. The amino acid sequence of fusion 
protein PBl is shown in Table 9; the DNA sequence 
encoding this protein is shown in Table 9A. The amino 
acid sequence of the HIV portion of protein PBl is shown 
in Table 13. The DNA sequence encoding the HIV portion 
of protein PBl is shown in Table 13A. 

Plasmid p590 contains approximately 1055 base pairs 
of DNA encoding essentially the HTLV-III env gene 
from the PvuII site to the Hin di II site. This plasmid 
in a suitable host, e.g., E. coli , can be used to pro- 
duce the novel recombinant HTLV-III 86 kD protein 
denoted 590. The amino acid sequence of fusion 
protein 590 is shown in Table 10; the DNA sequence 
encoding this protein is shown in Table 10A. The amino 
acid sequence of the HIV portion of protein 590 is shown 
in Table 14. The DNA sequence encoding the HIV portion 
of protein 590 is shown in Table 14A. 
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Plasmid pKHl contains approximately 1830 base 

pairs of DNA encoding essentially the HTLV-III env 

gene from the Kpn l site to the Hin dlll site. This 

plasmid in a suitable host, e.g., E. coli , can be used 

. to produce the novel recombinant HTLV-III 70 kD protein 

denoted KH1. The amino acid sequence of fusion protein 

KH1 is shown in Table 11; the DNA sequence encoding 

this protein is shown in Table 11A. The amino acid 

sequence o£ the HIV portion of protein KRT is shown in 

Table 15. The DNA sequence encoding the HIV portion of 

protein KH1 is shown in Table ISA. 

Plasmid pBGl is deposited in the E. coli host 

MS371 with the Northern Regional Research Laboratory 

(NRRL) , U.S. Department of Agriculture, Peoria, Illi- 
« 

nois, USA. It is in the permanent collection of this 
repository. E. coli MS371 (pBGl)., jg'RRL B-15904, was 
deposited on Nov. 1, 1984. E. coli MS371, NRRL B- 
^15129 Us now available to the public. E. coli" SG2Q251, 
NRRL 5^15918 J Vas deposited on Dec. 12, 1984. NRRL 
B-15904 and NRRL B-15918 will be available to the public 
upon the grant of a patent which discloses them. Other 
cultures which were deposited with NRRL and their 
deposit dates and numbers are as follows: 
Culture Repository No. Date of Deposit 

E. coli JM103(pREV2.2) NRRL B-18091 July 30, 1986 
E. coli SG20251(pR100 NRRL B-18093 July 30, 1986 
E. coli SG20251(pPBl) NRRL B-18092 July 30, 1986 
E. coli SG20251(p590) NRRL B-18094 July 30, 1986 
E- coli CAG629(pKHl) NRRL B-18095 July 30, .1986 
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The above deposits will be maintained in the NRHL 
repository for at least 30 years and v;ill be made 
available to the public upon the grant of a patent 
disclosing them. The deposits are also available 
as required by foreign patent laws in countries wherein 
counterparts of the subject application, or its progeny, 
are filed. However, it should be understood " that the 
availability of a deposit does not constitute a license 
to practice the subject invention in derogation of 
patent rights granted by governmental action . 

The novel HTLV-iil proteins of the subject 
invention can be expressed in Saccharomyces cerevisiae 
using plasmids containing the inducible galactose 
promoter from this organism (Broach, J.R. , Li, Y. , 
Wu, L.C. and Jayaram, M. in Experimental Manipulation 
of Gene Expression [1983] p. 83, ed. M. Inouye , 
Academic Press) . Thsse plasmids are called YEp51 and 
YEp52 (Broach, J.R. et al. [1983]) and contain the 
E- coli origin of replication, the gene for 8-lactamase , 
the yeast LEU2 gene, the 2 urn origin of replication and 
the 2 pin circle REP 3 locus. Recombinant gene expression 
is driven by the yeast GAL10 gene promoter. 

Yeast promoters such as galactose and alcohol 
dehydrogenase (Bennetzen, J.L. and Hall, B.D. [1982] 
J. Biol. Chero. 257:3018; Ammerer, G. in Methods in 
Enzymology [1983] Vol. 101, p. 192), phosphoglycerate 
kinase (Derynck, R. , Hitzeinan, R.A. , Gray, P.W. , 
Goeddel, D.V. , in Experimental Manipulation of Gene 
Expression [1983] p. 247 > ed. M. Inouye, Academic 
Press), triose phosphate isomerase (Alber, T. and 
Kawasaki, G. [1982] J. Molec. and Applied Genet. 
1:419), or enolase (Innes, M.A. et al. [19853 Science 
226:21) can be used. 
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The genes disclosed herein can be expressed 
in simian cells. When the genes encoding these proteins 
are cloned into one of the plasmids as described in 
Okayama and Berg (Okayama, H. and Berg, P. [1983] 
Molec. arid Cell. Biol. 3:280) and references therein, 
or COS cells transformed with these plasmids, synthesis 
of ETLV-III proteins can be detected immunologically. 

Other mammalian cell gene express ion /protein 
production systems can be used. Examples of other such 
systems are the vaccinia virus expression system 
(Moss, B. [1985} Immunology Today 6 ; 243; Chakrabarti, S- 
Brechling, K. , Moss, B. [19853 Molec. and Cell. Biol. 
5:3403) and che vectors derived from murine retroviruses 
(Mulligan, R.C. in Experimental Manipulation of Gene 
Expression [1983] p. 155, ed. M. Inouye , Academic Press) 

The HTLV-III proteins of che subject invention 
can be chemically synthesized by solid phase peptide 
synthetic techinques such as BOC and FMOC (Merrifield, 
R.B. [1963] J. Amer. Cheta. Soc. 85:2149; Chang, C. 
and Meienhof er , J. [1978] Int. J. Peptide Protein Res. 

11:246). 

As is well known in the art , the amino acid 
sequence of a protein is determined by the nucleotide 
sequence of che DNA. Because of the redundancy of the 
genetic code, i.e., more than one coding nucleotide 
triplet (codon) can be used for most of the amino 
acids used to make proteins, different nucleotide 
sequences can code for a particular amino acid. Thus, 
the genetic code can be depicted as follows: 
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-C2 . 


Phenylalanine (Phe) 


TTK 


Histidine (His) 


CAK 


Leucine (Leu) 


XTY 


Glut amine (Gin) 


CAJ 


Isoleucine (lie) 


ATM 


Asparagine (Asn) 


AAK 


Methionine (Met) 


ATG 


Lysine (Lys) 


AAJ 


Valine (Val) 


GTL 


Aspartic acid (Asp) 


GAK 


Serine (Ser) 


QRS 


Glutamic acid (Glu)- 


CAJ 


Proline (Pro) 


CCL 


Cysteine (Cys) 


TGK 


Threonine (Thr) 


ACL 


Tryptophan (Trp) 


TGG 


Alanine (Ala) 


GCL 


Arginine (Arg) 


WGZ 


Tyrosine (Tyr) 


TAK 


Glycine (Gly) 


GGL 


Termination Signal 


TAJ 






Termination Signal 


TGA 
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Key: Each 3-letter deoxynucleotide triplet corresponds 
15 to a trinucleotide of tnRNA, having a 5 f -end on the 

left and a 3 '-end on the right. All DNA sequences 
given herein are those of the strand whose sequence 
corresponds to the raRNA sequence, with thymine substi- 
tuted for /uracil. The letters stand for the purine or 
20 pyrimidine bases forming the deoxynucleotide sequence. 

A = adenine 
G = guanine 
C - cy to sine 
T - thymine 
25 X - T or C if Y is A or G 

X = C if Y is C or T 

Y = A, G, C or T if X is C 

Y = A or G if X is T 

W = C or A if Z is A or G 
30 B * C if Z is C or T 

Z = A, G, C or T if W is C 
Z = A or G if W is A 

QR = TC if S is A, G, C or T ; alternatively QR = 
AG if S is T or C 



35 



i 

i 



-9- 



R149.C2 



J a A or G 

K 5 I or C 

L - A, I, C or G 

M = A, C or T 

The above shows that the novel amino acid sequences 
of the KTLV-III proteins of the subject invention can 
be prepared by nucleotide sequences other than those 
disclosed herein. Functionally equivalent nucleotide 
sequences encoding the novel amino acid- sequences of 
these HTLV-III proteins, or fragments thereof having 
HTLV-III antigenic or immunogenic or therapeutic 
activity, can be prepared by known synthetic procedures. 
Accordingly, the subject invention includes such 
functionally equivalent nucleotide sequences. 

Thus the scope of the subject invention includes 
not only the specific nucleotide sequences depicted 
herein, but also all equivalent nucleotide sequences 
coding for molecules with substantially the same 
HTLV-III antigenic or immunogenic or therapeutic 
activity. 

Further, the scope of the subject invention is 
intended to cover not only the specific amino acid 
sequences disclosed, but also similar sequences coding 
for proteins or protein fragments having comparable 
ability to induce the formation of and/or bind to 
specific HTLV-III antibodies. 

The term "equivalent" is being used in its ordinary 
patent usage here as denoting a nucleotide sequence which 
performs substantially as the nucleotide sequence iden- 
tified herein to produce molecules with substantially the 
same HTLV-III antigenic or immunogenic or therapeutic 
activity in essentially the same kind of hosts. Within 
this definition are sub fragments which have HTLV-III 
antigenic or immunogenic or therapeutic activity. 
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As disclosed above, it is well within Che skill 
of those in the genetic engineering art to use the 
nucleotide sequences encoding HTLV-III antigenic or 
immunogenic or therapeutic activity of the subject 
5 invention to produce HTLV-III proteins via microbial 

processes. Fusing the ' sequences into' an expression 
vector and transforming or transfecting into hosts, 
either eukaryotic (yeast or mammalian cells), or 
prokaryotic {bacterial cells), are standard procedures 
10 used in producing other well-known proteins, e.g., 

insulin, interferons, human growth hormone, IL-1 , XL- 2 , 
and the like. Similar procedures, or obvious modifica- 
tions thereof, can be employed to prepare HTLV-III 
proteins by microbial means or tissue-culture technology 
15 in accord with the subject invention. 

The nucleotide sequences disclosed herein can be 
prepared by a M gene machine' 1 by procedures well known 
in the art. This is possible because of the disclosure 
of the nucleotide sequence. 
• 20 The restriction enzymes disclosed can be purchased 

from Bethesda Research Laboratories, Gai thersburg , HD , 
or New England Bio labs , Beverly, MA. The enzymes are 
used according to the instructions provided by the 
supplier . 

25 The various methods employed in the preparation- 

of the plasmids and transformation of host organisms 
are well known in the art. These procedures are all 
described in Maniatis , T. , Fritsch, E.F., and Sarnbrook, 
J. (1982) Molecular Cloning: A Laboratory Manual , 
30 Cold Spring Harbor Laboratory, New York. Thus, it 

is within the skill of these in the genetic engineering 
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arc to extract DNA from microbial cells , perform 
restriction enzyme digestions, electrophorese DNA 
fragments, tail and anneal plasmid and insert DNA, 
ligate DNA, transform cells, e.g., E. coli cells, 
prepare piasmid DNA, electrophorese proteins, ana 
sequence DNA. 

Immunochemical assays employing the K7LV-III 
proteins of the invention can take a variety of forms. 
The preferred type is a solid phase immunotaetric assay. 
In assays of this type, an HTLV-III protein is immobilized 
on a solid phase to form an antigen- irsmunoadsorbent . 
The immunoadsorbent is incubated with the sample to be 
tested. After an appropriate incubation period, the 
immunoadsorbent is separated from the sample and 
labeled ant i- (human IgG) antibody is used to detect 
human anti-HTLV-III antibody bound zc the issauncadsor- 
bent. The amount of label associated with the iizsnuno- 
adsorbent can be compared to positive and negative 
controls to assess the oresence or absence of anti- 
HTLV-III antibody. 

The itnrriunoadsorbent can be prepared by adsorbing 
or coupling a purified HTLV-III protein to a solid 
phase. Various solid phases can be used, such as 
beads formed of glass, polystyrene, polypropylene, 
dextran or other material. Other suitable solid phases 
include tubes or plates formed from or coated with - 
these materials. 

The KXLV-III proteins can be either covalently or 
non-covalently bound to the solid, phase by techniques 
such as c ova lent bonding via an amice or ester linkage 
or adsorption. After the HTLV-III protein is affixed 
to the solid phase, the solid phase can be post-coated 
with an animal protein, e.g., 3% fish gelatin. This 
provides a blocking protein which reduces nonspecific 
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adsorption of protein to the immunoadsorbent: surface. 

The immunoadsorbent is then incubated with the 
sample to be tested for anti-KTLV-III antibody* In 
blood screening, blood plasma or serum is used. The 
plasma or serum is diluted with normal animal plasma 
or serum. The diluent plasma or serum is derived 
from the same animal, species that is the source of 
the anti- (human IgG) antibody. The preferred anti- 
( human IgG) antibody is goat anti- (human IgG) antibody. 
Thus, in the preferred format, the diluent would be 
goat serum or plasma. 

The conditions of incubation, e.g., pK and tempera- 
ture, and the duration of incubation are not crucial. 
These parameters can be optimized by routine, experimen- 
tation. Generally, the incubation will be run for 1-2 
hr at about 45°C in a buffer of pH 7-8- 

After incubation, the immunoadsorbent and the 
samole are separated. Separation can be accomplished 
by any conventional separation technique such as 
sedimentation or centrif u^aticn. The immunoadsorbent 
then may be washed free of samole to eliminate any 
interfering substances . 

The immunoadsorbent is incubated with the labeled 
anti- (human IgG) antibody (tracer) to detect human 
antibody bound thereto. Generally the immunoadsorbent 
is incubated with a solution of the labeled anti- (human 
IgG) antibody which contains a small amount (about 1%) 
of the serum or plasma of the animal species which 
serves as the source of the anti- (human IgG) antibody. 
Anti- (human IgG) antibody can be obtained from any 
animal source. However, goat anti- (human IgG) antibody 
is preferred. The anti- (human IgG) antibody can be an 
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antibody against the Fc fragment of human IgG , for 
example, goat anti- (human IgG) Fc antibody. 

The anti- (human IgG) antibody or anti- (human 
IgG) r c can be labeled with a radioactive material 
sucn as " iodxnej labeled with an optical label, such 
as a fluorescent material; or labeled with an enzyme 
such as horseradish peroxidase. The anti-human antibody 
can also be biotinylated and labeled avicin used to 
detect its binding to the immune adsorbent . 

After incubation with the labeled 'antibody , the 
immunoadsorbent is separated from the solution and 
the label associated with the immunoadsorbent is 
evaluated. Depending upon the choice of label, the 
evaluation can be done in a variety of ways- The label 
may be detected by a gamma counter if the label is a 
radioactive gamma emitter, or by a fiuorimeter, if the 
label is a fluorescent material. In the case of an 
enzyme, label detection may be done coicrimetrically 
employing a substrate for the enzyme. 

The amount of label associated with the imrnuno- 
adsorbent is compared with positive and negative 
controls in order to determine the presence cf anti- 
HTLV-III antibody. The controls are generally run 
concomitantly with the sample to be tested. A positive 
control is a serum containing antibody against HTLV-III 
a negative control is a serum from healthy individuals 
which does not contain antibody against HTLV-III. 

For convenience and standardization, reagents 
for the performance of the immunometric assay can be 
assembled in assay kits- A kit for screening blood, 
for example, can include: 

(a) an imrnunoadsorbent, e . ^. , a polystyrene bead 
coated with anHTLV-III protein; 
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Co) a diluent for the serum or plasma sample, 
e-g-, normal goat serum or plasma; 

(c) an anti-(human IgG) antibody, e.?,, goat 
anti-(hunian IgG) antibody in buffered, aqueous 
solution containing about 1% p T oat serum or 
plasma; 

(d) a positive control, e.g., serum containing 1 
antibody against at least one of the novel 
HTLV-III proteins; and 

(e) a negative control, e.g., pooled sera from 
healthy individuals which does hot contain 
antibody against at least one of the novel 
HTLV-III proteins.. 

If the label is an enzyme, an additional element 
of the kit can be the substrate for the enzyme. 

Another type of assay for ant i -HTLV-III antibody 
is an antigen sandwich assay. In this assay, a labeled 
HTLV-III protein is used in place of anti- (human IgG) 
antibody to detect anti-HTLV-III antibody bound to the 
iinmunoadsorbent . The assay is based in principle on the 
bivaiency of antibody molecules. One binding site of 
the antibody binds the antigen affixed to the solid 
phase; the second is available for binding the labeled 
antigen. The assay procedure is essentially the same 
as described for the imaunometric assay except that 
^Ster incubation with the sample, the intsunoadsorbent 
is incubated with a solution of labeled HTLV-III protein. 
HTLV-III proteins can be labeled with radioisotope, 
an enzyme, etc, for this type of assay. 

in a third format, the bacterial protein, protein A, 
which binds the Fc segment of an IgG molecule without 
interfering with the antigen- antibody interaction can 
be used as the labeled tracer to detect anti-HTLV- 
antihody adsorbed to the imrnunoadscrbent . Protein A 
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can be readily labeled with. a radioisotope, enzyme or 
other detectable species. 

Immunochemical assays employing an HTLV-III 
protein have several advantages over those employing 
a whole (or disrupted) virus. Assays based upon an 
HTLV-III protein will alleviate the concern over growing 
large quantities of infectious virus and the inherent 
variability associated with cell culturing and virus 
production. Further, the assay will help mitigate the 
real or perceived fear of contracting AIDS by technicians 
in hospitals, clinics and blood banks who perform the 
test . 

Vaccines comprising one or more of the HTLV-III 
proteins, disclosed herein, and variants thereof having 
antigenic properties, can be prepared by procedures well 
known in the art. For example, such vaccines can be prepared 
as injectables, e.g., liquid solutions or suspensions. Solid 
forms for solution in, or suspension in, a liquid prior to^ 
injection 5 also can be prepared. Optionally, the 
preparation also can be emulsified. The active anti- 
genic ingredient or ingredients can be mixed with excipi- 
ents which are pharmaceuticaily acceptable and compatible 
winh the active ingredient. Examples of suitable 
excipients are water, saline, dextrose, glycerol, 
ethanol, or the like, and combinations thereof. In 
addition, if desired, the vaccine can contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, or adjuvants, 
which enhance the effectiveness of the vaccine. The 
vaccines are conventionally administered parenterally , 
by injection, for example, either subcutaneous ly or 
intramuscularly. Additional formulations which are 
suitable for other modes of administration include 
suppositories and, in some cases, oral formulations. 
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For suppositories, traditional binders and carriers 
include, for example, polyalkalene glycols or trigly- 
cerides. Suppositories can be formed from mixtures 
containing the active ingredient in the range of about 
0.5% to about 10%, preferably about 1 to about 2%. 
Oral formulations can include such normally employed 
excipients as, for example, pharmaceutical grades of' 
manitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate and the 
like. These compositions can take the form of solu- 
tions, suspensions, tablets, pills, capsules, sustained 
release formulations or powders and contain from about 
10% to about 95% of active ingredient, preferably from 
about 25% to about 70%. 

The proteins can be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with the 
free amino groups of the peptide) and which are formed 
with inorganic acids such as , for example , hydrochloric 
or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic , and the like. Salts 
formed with the free carboxyl groups also can be 
derived from inorganic bases such as , for example , 
sodium, potassium, ammonium, calcium, or ferric hy- 
droxides, and such organic bases as isopropylamine , 
trimethylamine , 2-ethylamino ethanol, histidine, pro- 
caine, and the like. 

The vaccines are administered in a manner compati- 
ble with the dosage formulation, and in such amount as 
will be therapeutically effective and immunogenic. The 
quantity to be administered depends on the subject to 
be treated, capacity of the subject's immune system to 
synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required 
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to be administered depend on the judgment of the 
practitioner and are peculiar to each individual. 
However, .suitable dosage ranges are of the order of 
about several hundred micrograms active ingredient 
per individual. Suitable regimes for initial admini- 
stration and booster shots are also variable, but are 
typified by an initial administration followed in one 
or two week intervals by a subsequent injection or 
other administration . 

HTLV-III is known to undergo amino acid sequence 
variation, particularly in the envelope gene (Starcich, 
B..R. [1986] Cell 45:637-648; Hahn, B-H- et al. [1986] 
Science 232:1548-1553). Over 100 variants have been 
analyzed by molecular cloning and restriction enzyme 
recognition analysis, and several of these have been 
analyzed by nucleotide sequencing. Some of these are 
the HTLV-III isolates known as RF (Popovic, M. et al. 
[1984] Science 224:497-500), UMJ-1 (Hahn, B.H. et al. 
[1986] Science 232:1548-1553), LAV (Wain-Hobson, S. 
et al. [1985] Cell 40:9-17), and ARV-2 (Sanchez- 
Pescador, R. et al. [1985] Science 227:484-492). 
Although the subject invention describes the sequence 
from one HTLV-III isolate, the appropriate envelope 
regions of any HTLV-III isolate can be produced using 
the procedures described herein for preparing R10, 
FBI, 590, and KHI ♦ The HTLV-III proteins from 
different viral isolates can be used in vaccine 
preparations , as disclosed above , to protect 
against infections by different HTLV-III 
isolates. Further, a vaccine preparation can be 
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made using more than one recombinant antigenic protein 
from mors than one HTLV-III isolate to provide 
immunity and thus give better protection against 
AIDS. 

Following are examples which illustrate the 
process of the invention, including the best mode. 
These examples should not be construed as limiting. 
All solvent mixture proportions are by volume unless 
otherwise noted. 

Example 1—Construction of plasmid pREV2 . 2 

The pREV2.2 plasmid expression vector was con- 
structed from plasmid pBGl. Plasmid pBGl can be 
isolated from its E. coli host by well known proce- 
dures , e,.g., using cleared lysate-isopycnic density 
gradient procedures, and the like. Like~pBGl, pREV2.2 
expresses inserted genes behind the E- coli promoter. 
The differences between pBGl and pR£V2. 2 are the 
following: 

1. pREV2.2 lacks a functional replication of 
plasmid ( top ) protein. 

2. pREV2.2 has the trp A transcription terminator 
inserted into the Aatll site. This sequence 
insures transcription termination of over- 
expressed genes. 
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3. pREV2.2 has genes to provide resistance to 
ampicillin and chloramphenicol, whereas pBGl 
provides resistance only to ampicillin. 

4. pREV2.2 contains a sequence encoding sites for 
several restriction endonuc leases . 

The following procedures , shown in Figure 1 of 
the drawings , were used to make each of the four 
changes listed above: 

la. 5 pg of plasmid pBGl was restricted with Ndel 
which gives two fragments of approximately 
2160 and 3440 base pairs* 

lb. 0.1 yg of DMA from the. digestion mixture > after 
inactivation of the Ndel, was treated with 
T4 DNA ligase under conditions that favor 
intramolecular ligation (200 pi reaction 
volume using standard T4 ligase reaction 
conditions [New England Biolabs, Beverly, 
MA]). Intramolecular ligation of the 3440 
.base pair fragment gave an ampicillin resistant 
plasmid. The ligation mixture was transformed 
into the recipient strain E. coli JM103 
(available from New England Bio labs) and 
ampicillin resistant clones were selected 
by standard procedures. 

1c. The product plasmid, pBGlAN, where the 2160 
base pair Nde l fragment is deleted from pBGl, 
was selected by preparing plasmid from 
ampicillin resistant clones and determining 
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the restriction digestion patterns with Ndel 
and Sai l (product fragments approximately 17.90 
and 1650). This deletion iuaetivates the rop 
gene that controls plasmid replication. 

2a. 5 yg of pBGlAN was then digested with EcoRI and 
Bel l and the larger- fragment , approximately 2455 
base pairs, was isolated. 

2b. A synthetic double stranded fragment was prepared 
by the procedure of Itakura et al. (Itakura, K. , 
Rossi, J.J. and Wallace, R.B. [1984]- Ann. Rev. 
Biochem. 53:323-356, and references therein) with 
the structure shown in Table 1. This fragment 
has Bel l and EcoRI sticky ends and contains recog- 
nition sequences for several restriction endonu- 
cleases. 

2c. 0.1 of the 2455 base pair EcoR I- Bcl I fragment 
and 0.01 ug of the synthetic fragment were joined 
with T4 DNA ligase and competent cells of strain 
JM103 were transformed. Cells harboring the 
recombinant plasmid, where the synthetic fragment 
was inserted into pBGlAN between the Bel l and EcoR I 
sites, were selected by digestion of the plasmid 
with Hpa l and EcoR I . The diagnostic fragment sizes 
are approximately 2355 and 200 base pairs. This 
plasmid is called pREVl. 

2d. 5 ug of pREVl were digested with Aatll, which 
cleaves uniquely. 

2e. The double stranded fragment shown in Table 2 was 
synthesized. This fragment has Aat ll sticky ends 
and contains the troA transcription termination 
sequence. 

2f. 0.1 yg of Aat ll digested pREVl was ligated with 
0.01 pg of the synthetic fragment in a volume 
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of 20 yl using T4 DMA ligase. 

Cells or strain JM103, made competent:, were trans- 
formed and ampicillin resistant clones selected. 
Using a Kpn l , Eco RI double restriction digest 
of plasmid isolated from selected colonies, 
a cell containing the correct construction 
was isolated. The sizes of the Kpnl , Eco RI 
generated fragments are approximately 24 75 and 
80 base pairs. This plasmid is. called p REV ITT 
and contains the trp A transcription terminator. 

5 ug of pREVITT, prepared as disclosed above 
(by standard methods) was cleaved with Nde l 
and XmnI and the approximately 850 base pair 
fragment was isolated, 

5 pg of plasmid pBR325 (3RL, Gaithers- 
burg, MD) , which contains the genes conferring 
resistance to chloramphenicol as well as to 
ampicillin and tetracycline, was cleaved with 
Bel l and the ends blunted with Klenow polymerase 
and deoxynucieo tides . After inactivating the 
enzyme , the mixture was treated with Ndel and 
the approximately 3185 base pair fragment 
was isolated. This fragment contains the 
genes for chloramphenicol and ampicillin 
resistance and the origin of replication. 
0.1 yg of the Hde l- Xmn l fragment from pREVITT 
and the Nde l- Bcl l fragment from pBR325 were 
ligated in 20 yl with T4 UNA iigase and the 
mixture used to transform competent cells of 
strain JM103. Ceils resistant to both ampi- 
cillin and chloramphenicol were selected. 
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Using an EcoRI and Ndel double digest of 
plasmid from selected clones, a plasmid was 
selected giving fragment sizes of approxi- 
mately 2480, 1145, and 410 base pairs. This 
is called plasmid pREVlTT/chl and has genes 
for resistance to both ampicillin and chloram- 
phenicol. 

A double stranded fragment shown in Table 3 
was synthesized. This fragment, with a blunt 
end and an Ss t l sticky end, contains recogni- 
tion sequences for several restriction enzyme 
sites. 

5 ug of pREVlTT/chl was cleaved with Nrul 
(which cleaves about 20 nucleotides from the 
Bel l site) and Sst I (which cleaves within the 
multiple cloning site). The larger fragment, 
approximately 3990 base pairs , was isolated 
from an agarose gel. 

0.1 pg of the Nru l -SstI fragment from pREVlTT/chl 
and 0.01 yg of the synthetic fragment were 
treated with T4 DiuA ligase in a volume of 
20 M l. 

This mixture was transformed into strain JM1Q3 
and ampiciliin resistant clones were selected. 
Plasmid was purified from several clones and 
screened. by digestion with Mlul or Clal. 
Recombinant clones with the new multiple cloning 
site will give one fragment when digested wich 
either of these enzymes , because each cleaves 
the plasmid once . 

The sequence of the multiple cloning site was 
verified. This was done by restricting the 
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plasmid with Hpa l and PvuII and isolating 
the 1395 base pair fragment, cloning it into 
the Sflia l site of mpl8 and sequencing it by 
dideoxynucleotide sequencing using standard 
methods. 

4g. This plasmid, called pR£V2.2 is diagrammed 
in Figure 2 of the drawings. 

Example 2- -Construction of and expression from pRiQ 

Plasmid pRIO , which contains approximately 1275 
base pairs of DNA encoding the HTLV-II1 env gene from 
essentially the Kpn l site to the Bgl ll site, and from 
which is synthesized an approximately 95 kD fusion 
protein containing this portion of the gp!20 envelope 
protein, can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown 
in Table 4. This DNA fragment can be synthe- 
sized by standard methods (Itakura, et aL , 
supra, and references therein) and encodes 

a portion of gpl20 . It has a blunt end on the 
5 ? end and an end which will ligate with a 
BamHI overhang on the 3 f end. 

2. Restricting 5 ug of plasniid pBGl with Bel l , 
filling in the overhanging ends with Klenow 
polymerase and deoxyr ibonuc leo tide triphos- 
phates (dNTPs) , restricting this fragment 
with Bam HI and isolating the large fragment , 
approximately 8.9 kb, from an agarose gel. 

3. Ligating 0.1 ug of the fragment in Table A i*ith 
0.1 ug of the pBGl fragment in a volume of- 

20 \il using TU DNA iigase , transforming the 
ligation mixture into competent cell strain 
SG20251 (Gottesman , S. 5 Halpem , E. and Trisier 
P. [1981] Journal of Bacteriology 148:265-273), 
and selecting ampicillin resistant transformant 
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4. Selecting, using the Ahalll restriction 
pattern of purified plasmid, cells harboring the 
recombinant plasmid with the synthesized 
fragment in the orientation whereby the frag- 
ment blunt end ligated to the pBGl fragment 
filled- in Bel l end and the Bam HI overhanging 
ends ligated together. Aha lll digestion of the 
proper plasmid gives fragment lengths of 
approximately 5300, 3170, 690, 640, 330, and 

20 base pairs . 

5. When the strain harboring this recombinant plas- 
mid is grown in 2% medium (2% yeast extract , 
bactotryptone , casamino acids (Difco, Detroit, 
KI) , 0.2% potassium monobasic, 0.2% potassium 
dibasic, and 0.2% sodium dibasic) containing 

50 yg/ml ampiciliin and the total complement 
of cellular proteins electrophoresed on an 
SDS-polyacrylamide gel, a prominent protein of 
approximately 95 kD can be visualized by either 
coomassie blue staining or by western blot 
analysis using as probe selected sera from 
AIDS ♦ ARC, or HTLV-III infected individuals. 

Example 3--?urif icaticn of recombinant orotein containing 
HTLV^III envelope sequences from plasmid pRIQ 

1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor (Chemapec, Woodbury, NY) in 2% medium. Fer- 
mentation temperature was 37 °C , the pK vas 6.8, and air 
was provided at 1 wm. Plasmid selection was provided 
by 50 jjg/nil ampiciliin. Typical cell yield (wet weight) 
is 30 g/1. 

2. Cell lysis : 

50 g, -wet ceil weight, of £. coii containing the 
recombinant HTLV-III envelope fusion protein were 



-25-- 



R149-C2 



resuspended in a final volume of 100 ml in 50 rsM 
Tris-Cl pH 8.0, 5 mM ethylenediaminetetraacetic 
acid (EDTA) , 5 mM dithiothreitol (DTT) > 15 
niM B-mercaptoethanol, 0.5% TRITON^ X-100, and 5 aM 
phenylmethylsulfonyl fluoride (PMSF) . 300 mg lysozyme 
was added and Che suspension incubated for 30 min at 
room temperature. 

This material was lysed using a BEAD -BEATER™ 
(Biospec Products, Bartlesville , OK) containing an 
equal volume of 0*1-0.15 urn glass beads-. The lysis 
was done for 6 min at room temperature in 1 min 
intervals . The liquid was separated from the beads and 
centrifuged for 2.5 hr at 20,000 xg. The supernatant 
was removed and the pellet dissolved in 100 ml 8 M 
urea, 20 mM Tris-Cl pR 8.0, 5 xnM DTT , 15 inI4 S-mercap- 
toethanol, 5 inM PMSF, and 1 mM EDTA. The pellet was 
solubilized using a polyrron homogenizer (Beckman , 
Berkeley, CA) and centrifuged at 20,000 xg for 2 hr. 
3- Diethylaminoethyl (DEAE) chromatography: 

Supernatant was loaded onto a 550 mi column 
(5 cm x 28 cm) packed with DEAE Fast Flow SEPHAROSE® 
(Pharmacia, Piscataway, Nj) equilibrated in 8 H urea, 
20 mM Tris-Cl pH 8.0, 15 mM g-mercaptoethsnol , and 
1 mH KEDTA at room temperature. The column was washed 
with 1.5 liters equilibration buffer, and the protein 
eluted with a 5.0 liter linear gradient from 0-0.8 H 
NaCl in equilibration buffer. The HTLV-IIX protein 
eluted at 0.2 M NaCl and was assayed using SDS-poly- 
acryiamide electrophoresis and following the prominent 
protein at approximately 95 kD. 

The fractions containing the HTLV-III protein 
were pooled and the protein concentrated to 10 ml 
using a stressed cell positive pressure concentrator 
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(Amicon, Danvers , MA) fitted vith a 10,000 MW cut-off 
membrane (YM-10 , Amicon) . The concentrate was loaded 
onto a 500 ml column (2.5 cm x 100 cm) packed with 
superfine sephacryl S-300 (Pharmacia) equilibrated in 
8 M urea, 20 mM Tris-Cl, pH 8.0, 15 mM B-mercaptoethanol , 
and 1 mM EDTA. The column was eluted with equilibra- 
tion buffer at room temperature. A flow rate of 0.5 
ml/min was maintained. The HTLV-III protein eluted at 
approximately 40% of the column volume. 

Example 4- -Construction of and expression from ulasmid 



Plasinid pPBl , which contains approximately 540 
base pairs of DNA encoding essentially the HTLV-III env 
gene from the PvuII site to the Bgl ll site, and from 
which is synthesized an approximately 26 kD fusion 
protein containing this portion of the gp!20 envelope 
protein can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown in- 
Table 15: This DMA fragment can be synthesized by 
standard methods and encodes a portion of gp!20. 
It has a blunt end otx the 5' end and an end which 
will ligate with a Bamhl overhang on the 3' end. 

2. Restricting 5 yg plasmid pR£V2.2 with ScoRV and 
BamHI and isolating the large fragment , approximately 
4 kD, from an agarose gel. 

3. Ligating 0.1 ug of the fragment in Table 15 with 0.1 
yg of the pREV2.2 fragment in a volume of 20 yl 
using T4 DNA ligase , transforming the ligation mix- 
ture into competent cell strain SG20251 5 and selecting 
ampicillin resistant trans f onnants . 

4. Using the Ahalll restriction pattern of purified 
plasmid, selecting cells harboring the recombinant 
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plasmid with the synthesized fragment in the 
orientation whereby the fragment blunt end ligated 
to the REV2.2 EcoR V end and the BamH I overhanging ends 
ligated together. Ahalll digestion of the proper 
plasmid gives fragment lengths of approximately 
1210, 1020, 750, 690, 500, 340, and 20 base pairs . 
When the strain harboring this recombinant plasmid 
is grown in 2% medium containing 50 Mg/ml ampicillin 
and the total complement of cellular proteins 
electrophoresed on an SDS -poly aery lamide gel, a 
protein of approximately 26 kD can be- visualized 
by either coomassie blue staining or by western 
blot analysis using as probe selected sera from 
AIDS, ARC , or HTLV-III infected individuals . 

Example 5 — Purif ication o£ recombinant protein con - 
taining HTLV- III, envelope sequences from plasmid oPS ^ xiIB 

1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor' in 2% medium. Fermentation temperature was 
37 °C, the pH was 6.8, and air was provided at 1 wis. 
Plasmid selection was provided by 50 ug/ml ampicillin 
and 20 ug/ml chloramphenicol. Typical cell yield 
(wet weight) was 30 g/1. 

2. Cell Lysis: 

50 g, wet cell weight, of E. coii containing the 
recombinant HTLV- III envelope fusion protein were 
resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, 5 mM EDTA, 5 mM DTT, 15 mM 8-niercap- 
toethanol, 0.5Z TRITON®X-100 , and 5 mM ?MSF. 300 mg 
lysozyme was added and the suspension incubated for 30 
min at room temoerature- 

This material was lysed using a BEAD - BEATER" 
(Biospec Products, Bartlesville , OK) containing an 
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equal volume of 0.1-0.15 urn glass beads. The lysis 
was done for 6 tain at room temperature in 1 rain 
intervals- The liquid was separated from the beads 
and centrifuged for 2.5 hr at 20,000 xg. The super- 
natant was removed and the pellet was resuspended in 
100 ml 6 M guanidine-hydrochloride, 20 mM Tris-Cl pH 
8.0, 5 TtiM DTT, 15 mM 8-mercaptoethanol , 5 mM PMSF, and 

1 mM EDTA. The pellet was solubilized using a 
polytron homogenizer and centrifuged at 20,000 xg 
for 2 hr. 

The supernate (90 ml) was dialysed against 4 liters of 
8 M urea, 20 mM potassium phosphate, pH 7.0, 1 mM EDTA, 
and 15 mM B-mercaptoethanol. Dialysis was done each time 
for 8 hr or longer with three changes of buffer. 
Spectraphor dialysis tubing (S/P, McGraw Park, XL) 
with a 3.5 kD MW cut-off was used. 
3. CM chromatography 

The dialysate was loaded onto a 550 ml column 
(5 cm- x 28 cm) packed wirh CM Fast Flow SEPHAROSE® 
(Pharmacia) equilibrated in 8 M urea, 10 mM potassium 
phosphate pH 7.0, 15 mM g-mercaptoethanol , and 1 mM 
EDTA at room temperature. The column was washed with 

2 liters equilibration buffer, and the protein e luted 
with a 5,0 liter linear gradient from 0-0.4 M NaCl- 

. The HTLV-III protein (26 kD) eiuted at approximately 
0.2 M NaCI as assayed by SDS -poly aery lamide gel 
electrophoresis - 

Example 6 --Cons true t ion of and expression from plasmid 
p590 

Plasmid p590 , which contains approximately 1055 
base pairs of DNA encoding essentially the HTLV-III 
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env gene from the PvuII site to the Hindi II site, and 
from which is synthesized an approximately 86 kD 
fusion protein containing this portion of the gp!60 
envelope protein can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown in 
Table 6: This DNA fragment can. be synthesized by 
standard methods and encodes a portion of gpl60. 
It has a blunt end on the 5' end and an end which 
will ligate with a Hindlll overhang on the 3' end. 

2. Restricting 5 ug plasmid pREV2 . 2 with EcoR V and 
Hindi I I and isolating the large fragment, approxi- 
mately A kD, from an agarose gel. 

3. Ligating 0.1 ug of the fragment in Table 6 with 
0.1 ug of the pREV2.2 fragment in a volume of 20 
ml using T4 DNA ligase , transforming the ligation 
mixture into competent cell strain SG20251, and 
selecting ampicillin resistant trans fonnants . 

4. Using the Ahalll restriction pattern of purified 
plasmid, selectiong cells harboring the recombinant 
plasmid with the synthesized fragment in the ori- 
entation whereby the fragment blunt end ligated to 
the pREV2-2 EcoR V end and the Hindi I I overhanging 
ends ligated together; Ahalll digestion of the 
proper plasmid gives fragment lengths of approxi- 
mately 1740, 1020, 750, 690, 500, 340, and 20. 

5. 5 ug of plasmid, purified from this strain, is. 
restricted with Nde l and Sma l . The approximately 
1425 base pair fragment is isolated from an agarose 
gel. The 1505 base pair fragment is fused to the 
DNA encoding the segment of gp!60. 

6. 5 ug of pBGlOl is restricted with BamHI , the over- 
hanging ends filled in with Klenow polymerase and 
dNT?s, and then restricted wich Ndel. The approxi- 
mately 6.5 kD fragment is isolated from an agarose 
gel. 
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7. Ligating 0.1 pg of the Ndel-Smal fragment with 0.1 
Ug of the pBGl fragment using T4 DNA ligase , trans- 
forming the ligation mixture into competent cell 
strain SG20251, and selecting ampicillin resistant 
transfonnants. 

8. Using the Ahalll restriction pattern of purified 
plasmid, selecting cells harboring the recombinant 
plasmid with the synthesized fragment in the ori- 
entation whereby the fragment blunt Sma l end ligated 
to the BamHI/ filled- in end and the Nde l overhanging 
ends ligated together. Ahalll digestion of the 
proper plasmid gives fragment lengths of approxi- 
mately 5900, 1020, 690, 430, and 20 base pairs. 

9. When the strain harboring this recombinant plasmid 
is grown in 2% medium containing 50 ug/ml ampicillin 
and the total complement of cellular proteins 
electrophoresed on an SDS-polyacrylamide gel, a 
protein of approximately 86 kD can be visualized 

by either coomassie blue staining or by western blot 
analysis using as probe selected sera from AIDS, ARC, 
or HTLV-III infected Individuals. 

Example 7 --Purification of recombinant protein con - 
taining HTLV-III envelope sequences from plasmid t>5 90 

1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor in 2Z medium. Fermentation temperature was 
37 °C, the pH was 6.8, and air was provided at 1 wm. 
Piasmid selection was provided by 50 pg/ml ampicillin. 
Typical cell yield (wet weight) is 30 g/1. 

2, Cell Lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
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resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, 5 mM EDTA, 5 mM DtT, 15 mM B-mercapto- 
ethanol. 0.5Z TRITON^X-100 , and 5 mM PMSF. 300 mg 
lysozyme was added and Che suspension incubated for 30 
min at room temperature. 

TM 

This material was lysed using a Bead-Beater 
containing 0.1-0.15 mm glass beads. The lysis was 
done for 6 min at room temperature in 1 min intervals. 
The liquid was separated from the beads and centrifuged 
for 2.5 hr at 20,000 xg. The supernatant was removed 
and the pellet was resuspended in 100 ml 6 M guanidine- 
hydrochloride, 20 mM Tris-Cl pH 8.0, 5 mM DTT, 15 mM 
B-mercaptoethanol , 5 mM PMSF, and 1 mM EDTA. The 
pellet was solubilized using a polytron homogenizer and 
centrifuged at 20,000 xg for 2 hr. 

The supernate (90 ml) was dialysed against k liters 
of 8 M urea, 20 mM Tris-Cl, pH 8.0, 1 mM EDTA, and 15 
mM 8-mercaptoethanol . Dialysis. was done each time for. 
8 hr 'or longer with three changes of buffer. 
3. Diethylaminoethyl (DEAE) chromatography 

Dialysate was loaded onto a 550 ml column (5 cm 
x 28 cm) packed with DEAE Fast Flow SEPRAROSE^ (Pharm- 
macia) equilibrated in 8 M urea, 20 mM Tris-Cl pH 8.0, 
15 mM B-mercaptoethanol , and 1 mM EDTA at room tem- 
perature. The column was washed with 1.5 liters 
equilibration buffer, and the protein eluted with a 
5.0 liter linear gradient from 0-0.8 M NaCl in equili- 
bration buff er . The HTLV-III protein eluted at 0.4 M 
NaCl and was assayed using SDS-polyacrylamide electro- 
phoresis and following the prominent protein at 
approximately 86 kD. 
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The fractions containing the HTLV-III protein 
were pooled and the protein concentrated to 10 ml using 
a stressed cell positive pressure concentrator (Amicon) 
fitted with a 10,000 MW cut-off membrane (YM-10, 
Amicon) . .The concentrate was loaded onto a 500 ml 
column (2.5 cm x 100 cm) packed with superfine 
SEPHACRYL®S-300 (Pharmacia) equilibrated in 8 M urea, 
20 mM Tris-Cl, pH 8.0, 15 mM 8-mercaptoethanol , and 
1 mM EDTA. The column was eluted with equilibration 
buffer at room temperature. A flow rate, of 0.5 ml /rain 
was maintained. The HTLV-III protein eluted at 
approximately 40% of the column volume. 

Example 8- -Construction of and expression from plasmid 
pKHl 

Plasmid pKHl, which contains approximately 1830 
base pairs of DNA encoding essentially the HTLV-III 
env gene from the Kpn l site to the Hindlll site, and 
from which is synthesized an approximately 70 kD 
fusion protein containing this portion of the gpl60 
envelope protein, can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown 
in Table. 7: This DNA fragment can be synthe- 
sized by standard methods and encodes a portion 
of gpl60. It has a blunt end on the 5' end 
and an end which will ligate with a Hin di II 
overhang on the 3' end. 

2. Restricting 5 ug plasmid pREV2.2 with Mlul , 
treating the DNA with Klenow polymerase to 
blunt the ends, treating with Hin di I I and 
isolating the large fragment , approximately 
5 kD, from an agarose gel. 

3. Ligating 0.1 yg of the fragment in Table 7 
with 0.1 yg of the pREV 2.2 fragment in a 
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volume of 20 ul using T4 DNA ligase , trans- 
forming the ligation mixture into competent 
cell strain CAG629, and selecting ampicillin 
Resistant transf ormants . 
k. Using the Ahalll restriction pattern of puri- 
fied plasmid, selecting cells harboring the 
recombinant plasmid with the synthesized 
fragment in the orientation whereby the frag- 
ment blunt end li gated to the REV2.2 Mlul 
end and the Hin di 1 1 overhanging ends ligated 
together. Ahalll digestion of the proper 
plasmid gives fragment lengths of approximately 
1730, 1020, 750, 690, 640, 600, 340', and 20 
base pairs. When the strain harboring this 
recombinant plasmid is grown in 2% medium 
containing 50 pg/ml ampicillin and the total 
complement of cellular proteins electrophoresed 
on an SDS-polyacrylamide gel, a protein of 
approximately 70 kD can be visualized by either 
Coomassie blue staining or by Western Blot 
analysis using as probe selected sera from 
AIDS, ARC, or HTLV-III infected individuals. 

Example 9- -Purification of recombinant protein containing 
HTLV-III envelope sequences from plasmid pKHl 
1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermenter in 2% medium- Fermentation temperature 
was 32°C, the pH was 6.8, and air was provided at 
1 win. Plasmid selection was provided by 50 ug/ml 
ampicillin. Typical cell yield (wet weight) is 30 

g/1- 
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2. Cell lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, S.xnM EDTA, 5 mM dithiothreitol (DTT), 
15 mM 0-mercaptoethanol, 0.5% TRITON*X-100 and 5 mM 
PMSF, 300 mg lysozyme was added and the suspension 
incubated for 30 min at room temperature. 

TM 

This material was ly.sed using a BEAD -BEATER 
(Biospec Products) containing an equal volume of 0.1- 
0.15 urn glass beads. The lysis was done for 6 min 
at room temperature in 1 min intervals . The liquid 
was separated from the beads and centrif uged for 2 . 5 
hr at 20,000 xg. The supernatant was removed and the 
pellet dissolved in 100 ml 8 M urea, 20 mM Tris-Cl 
pH 8.0, 5 mM DTT, 15 mM B-mercaptoethanol , 5 mM PMSF , 
and 1 mM EDTA., The pellet was solubilized using a 
polytron homogenizer (Beckman, Berkeley, CA) and 
centrif uged at 20,000 xg for 2 hr. 
3. DEAE chromatography: 

Supernatant was loaded onto a 550 ml column (5 cm 
x 28 cm) packed with DEAE Fast Flow SEPEAROSE® (Pharmacia) 
equilibrated in 8 M urea, 20 mM Tris-Cl pH 8.0, 15 
mM B-mercaptoethanol, and 1 mM EDTA at room tempera- 
ture. The column was washed with 1.5 liters equilibra- 
tion buffer, and the protein eluted with a 5.0 liter 
linear gradient from 0-0-8 M NaCl in equilibration 
buffer. The HTLV-III protein eluted at 0.2 M NaCl 
and was assayed using SDS-polyacrylamide electrophoresis 
and following the protein at approximately 70 kD. 

The fractions containing the HTLV-III protein 
were pooled and the protein concentrated to 10 ml 
using a stressed cell positive pressure concentrator 
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(Amicon) fitted with a 10,000 MW cut-off membrane 
(YM-10, Amicon). The concentrate was loaded onto a 
500 ml column (2.5 cm x 100 cm) packed with superfine 
SEPHACRYL®S-300 (Pharmacia) equilibrated in 8 M urea, 
20 mM Tris-Cl, pH 8.0, 15 mM 8-mercaptoethanol , and 
1 mM EDTA. The column was eluted with equilibration 
buffer at room temperature. A flow rate of 0.5 ml/min 
was maintained. The HTLV-III protein eluted at 
approximately 40% of the column volume. 
4. SDS-polyacrylamide electrophoresis: • 

The fractions containing KH1 were pooled and the 
protein concentrated using a stressed cell positive 
pressure concentrator fitted with a 10,000 MW cutoff 
membrane. 2 mg of protein was mixed with loading 
buffers and electrophoresed through a preparative 
SDS-polyacrylamide gel (40 cm x 20 cm x 4 mm) as 
described by M.W. Hunkapiiler, E. Lujan, F. Ostrander, 
and L.E. Hood, Methods in Enzymology 91:227-236 
(1983). The 70 kD HTLV-III protein was visualized 
with 0.25 M KC1 and eluted from the gel as described. 
The protein can be removed from the SDS by precipi- 
tation with acetone (Dynan, W.J. , Jendrisak, J. J. , 
Hager, D.A. and Burgess, R.R. [1981] J- Biol. Chem. 
256:5860-5865) . 

Example 10 --Cons true t ion of a non- fusion derivative 
of PB1 

A non-fusion derivative of the PBl protein containing 
no non-HTLV-III amino acids other than an N-tenninal 
methionine was constructed using oligonucleotide-directed 
site-specific mutagenesis (Inouye, S. and Inouye, M. , 
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"Synthesis & Applications of DNA & EHA'\ ed. Narang, Saran A. 
Academic Press, 1987). In this procedure, 90 non-HTLV-III bp 
upstream and 39 downstream of the env gene sequence in pPBl 
were dele ted via DNA loopouts generated by hybridization 
with synthetic oligonucleotides. 

The oligonucleotide synthesized for the N-tenninal 
loopout was designed so that the start ccdon of the 
B-glucuronidase gene is placed immediately adjacent to 
the 5' end of the HTLV-III env gene sequence (Figure 4). 
The oligonucleotide includes sequences homologous to 
both sides of this newly-created junction that allow 
proper hybridization to the plasmid DNA. 

The two DNA molecules used to form a heteroduulex 
with a single-stranded gap that is the substrate for 
hybridization were created by digesting pPBl with Sai l 
and Hpa l, or with Pst I alone. Digestion with Pst I 
linearized pPBl, and a double digest with Sai l and Hpa l 
yields fragments of 3800 and 700 bp, the larger of 
which was gel-isolated for use in the mutagenesis. 

Kinasing of the oligonucleotide, hybridization, 
polymerization and ligation to yield closed circular 
molecules weie done according to the methods of Inouye 
and Inouye mentioned above. To enrich for DNA molecules 
containing the deletion, the DNA mixture was digested 
with Mlul, which cuts within the region being deleted. 

The digested DNA was used to transform competent 
E . coli JK105 cells and plasmid- containing transformants 
were isolated by overnight growth on :YT (8 g tryptone, 
5 g yeast extract, and 5 g NaCl per liter) Cm plates 
at 37°C. 
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Plasmid DNA was isolated from each trans formant 
and screened for the correct construction by simultaneous 
digestion with M^ul and Hindlll. Molecules that were 
not deleted yielded fragments of approximately 3900 and 
600 bp. Those containing the deletion did not have 
the Mlul site and yielded linear molecules of approxi- 
mately 4400 bp. Plasmid DNA from transformants that 
appeared to contain the deletion was retransformed to 
ensure segregation of deleted and non-deleted plasmids 
and the recovery of pure plasmid populations. DNA 
from these second transformants was analyzed as in the 
previous digest and was determined to have the correct 
construction. This plasmid was designated p&PBl. 

To eliminate the C-terminal non-HTLV-III amino acids, 
oligonucleotide-directed site-specific mutagenesis was 
carried out as above, using the pAPBl plasmid as a 
substrate. The oligonucleotide (Figure 5) was designed 
to position the TGA codon that .occurs out -of -frame 
downstream from the env gene sequence so that it is 
immediately adjacent to the 3 1 end of the env gene sequence 
and in- frame to act as a trans lational stop codon. 

The molecules to form the heteroduplex used for 
hybridization were created by digesting pAPBl with Pst I 
alone or with Kpn l and Hpa l . The large Kpnl/Hoal fragment 
encompassing most of the vector was gel-isolated for 
use in the mutagenesis. Kinasing, hybridization, poly- 
merization and ligation were performed as above. 
Enrichment for deleted molecules was accomplished by 
digesting with Hindi II , which cuts within the region 
being deleted. The DNA was used to transform cells as 
above. 
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Plasmid DNA was isolated from each transformant 
and screened for the correct construction by simultaneous 
digestion with EcoR I and Hpa l. The deleted plasmid 
yields two restriction fragments of 2900 and 1750 bp. 
Plasmid DNA showing this pattern was retransformed as 
above, and DNA from these transf ormants was analyzed 
with the same digest. This plasmid, containing K-terminal 
and C- terminal deletions, is designated pd2?Bl. 

When the strain harboring plasmid pAPBl is grown in 
2% medium (2% yeast extract, bactotryptqne , casamino 
acids [Difco, Detroit, MI], 0.2% potassium monobasic , 
G.£% potassium dibasic, and 0.2% sodium dibasic) containing 
50 ug/ml amp ic ill in and the total complement of cellular 
proteins electrophoresed on an SDS^polyscrylamide gel, 
a protein of approximately 22 kD can be visualized by 
either coomassie blue staining or by western blot analysis 
using as probe selected sera from animals immunized with 
recombinant env gene proteins. Under the same conditions, 
a protein of approximately 20 kD is produced in a strain 
containing pd2P31. 

The technique of oligonucleo tide-directed site- 
specific mutagenesis can be used in a similar way to 
eliminate the non-HTLV-IIl amino acids flanking the env 
gene fusion proteins R10, 590, and KHl. 

In the procedure detailed above, the removal of 
the non-HTLV-III sequences from the fusion proteins 
involves removal of amino acids at both the N- terminus 
and the C- terminus of the protein and is accomplished 
in two sequential steps. 

It is well known in the art that a methionine at 
the terminal position can be enzymatically cleaved 
by the use of the enzyme methionine aminopeptidase (MAP) . 
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MAP has been cloned from E. coli (Ben-Bassat, A., Bauer, 
K. , Chang, S.-Y. , Myambo, K. , Boosman, A. and Chang, S. 
[1987} Journal of Bacteriology 169(2) : 751-757) and 
Salmonella typhimuriusu and in vitro activity has been 
demonstrated on recombinant proteins (Miller, C.G., 
Strauch, K-.L-, Kukral, A.M., Miller, J.L., Wingfield, 
P.T. , Massei, G.J., Werlen, R.C., Graber, P. and Mowa, 
N-R. [1987] Proc. Natl. Acad. Sci. USA 84:2718-2722). 
Therefore, removal of an N- terminal methionine may be 
achieved either in vivo by expressing the protein in a 
host which produces MAP (e.g., £. coli CMS 9 or S* 
cerevisiae) , or in vitro by use of purified MAP (e.g., 
procedure of Miller et al . ). 

pd2PBl Purification 

Unless specified otherwise, all steps are carried 
out at room temperature. 

Lysis — Three 700 ml bottles of frozen cell pasta containing 
pd2PBl are thawed at 37°C, and are then spun at 4,000 
rpm in a J-63 centrifuge with a JS-4.2 rotor (Beck-nan , 
Palo Alto, CA) at 4°C for 30 min. The supernatant 
is then discarded and the weight of the cell pellet is 
determined. The cell pellet (typically 1 kg) is resus- 
pended in 2 volumes of lysis buffer (v/w) which consists 
of 8 M urea, 20 inM Tris-KCl (pH 7.5 ± 0.1), 1 e££ £DTA, 
14.7 mM 2-mercaptoethanol and 1 xaM PMSF. 

The resuspended cell pellet is run through a Type 
TDK Pilot DYNO-MTLL° (Impandex Inc., Maywood, NJ) 
containing 0.5-0.7 ma glass beads at 200-400 mi/sin. 
Prior to use the DYN0-MILL @ is charged with one liter 
of lysis buffer and cooled so that the solution flowing 
through is at less than ambient temperature . The resus- 
pended cell pellet is passed through the DYNO-MILL^ twice 
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and after the second pass, the DYNO-MILL® is washed with 
1 liter of lysis buffer. Lysed cell suspension and wash 
are pooled. 

Concentration and filtration — The lysed cell suspension 
plus one liter wash is concentrated to 800 ml using a 
0.45 micron DURAPORE™ Pellicon cassette in a Pelii- 
con 4 GPM system (Millipore, Bedford, MA). The 
concentration is done with an inlet pressure of less 
than or equal to 40 psi and an outlet pressure between 
10 and 20 psi. After concentration the- lysed cell 
suspension is filtered with 4 liters of lysis buffer 
using the same Pellicon system, cassette and pressure 
settings with the tubing rigged for dyaf iltration. 
Extraction - -The washed lysis cell suspension is extracted 
with 10 1 of extraction buffer consisting of 6 M guanidine 
HC1, 100 mM Tris-HCl (pH 7.6 ± 0.1), and 10 nM EDTA, using 
the same Pellicon system, cassette and. pressure settings 
as described above with the tubing rigged for dyaf iltration. 
Buff er exchange — The filtrate from the previous sten is 
typically concentrated to 1 liter using a Pellicon 4G?M 
system with two PTGC cassettes (10,000 KMWL) . The con- 
centration is done with an inlet pressure of less than 
or equal to 50 psi and an outlet pressure between 30 
and 45 psi. After concentration, the supernatant is 
buffer exchanged with CM column buffer consisting of 
8 M urea, 25 nM potassium phosphate, and 1 inM EDTA 
(pH 6.8 x 0.1), with conductivity less than or equal to 
3.0 ms/cm. For buffer exchange, the same Pellicon system, 
the same cassettes and the same pressure settings as above 
are used with the tubing rigged for dyaf iltration. Eight 
liters of CM column buffer are used to buffer exchange I 
liter of concentrated extract. After buffer exchange, 
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the buffer -exchanged extract is drained from the system 
and the system is washed with 1 liter of CM column buffer. 
The buffer-exchanged extract and the wash are pooled and 
the solution's conductivity and pH are measured. The 
conductivity of the solution is adjusted to less than or 
equal to 3.0ms/cm with deionized 8 M urea and the pH is 
adjusted to be within the range of 6.5-7.0. 
CM chromatography - -A 50 x 51 cm column of CM SEPHAROSE® 
FAST FLOW (Pharmacia/ Fiscataway, NJ) is equilibrated 
by washing the column sequentially with 4 column volumes 
of 0.5 M NaOH, 2 column volumes of deionized water and 
2-3 column volumes of CM column buffer. The column is 
considered equilibrated when the pH of the outflow is 
within C.2 units of the CM column buffer and the conduc- 
tivity of the outflow is within 0,3 ms/cm of the CM 
column buffer. 

For loading, the buffer exchanged extract is pumped 
on to the column at an inlet pressure between 10 and 15 
psi. After loading, the CM column is washed with CM 
column buffer until the OD at 280 nm of the outflow 
is less than 0.1. The pd2PBl is then eluted with an 8- 
liter linear gradient of 0-0.5 M NaCl in CM column buffer 
and collected in 100 ml fractions. The fractions are 
assayed by SDS-PAGE and Western with anti-gpl60 antibody, 
and those containing significant pd2PBl and trace con- 
taminants are pooled. 
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Organic extraction — The pooled protein solution 
from the previous step is brought to a ratio of 55% 
acetonitrile to 45 % protein solution (v/v) by the slow 
addition of pure acetonitrile with mixing. After 
addition of all of the acetonitrile, the solution 
is centrifuged in a J2-21 centrifuge using a 
JA10 rotor (Beckman) at 10,000 rpm and A°C 
for 15 min. After centrifugation, the supernatant 
is collected and the pellet is discarded. 

The centrifugation supernatant is brought to a 
ratio of 35% ethanol to 65% supernatant (v/v) by slow 
addition of 95% ethanol with mixing. After addition 
of all of the ethanol, the solution is centrifuged 
in a J2-21 centrifuge using a JA-10 rotor at 10,000 rpm 
and 4°C for 15 min. After centrifugation the pellet is 
collected and the supernatant is discarded. 

The pellet is allowed to air dry for 15 min, and 
is then redissolved in S-30Q column buffer, which consists 
of 8 M urea, 0.3 M glycine, 5 mM EDTA, 15 mM 2-mercapto- 
ethanol, 1 mM dithiothreitol (DTT) (pH 8.50 ± 0.01). 
The pellet is dissolved in a volume of S-300 column 
buffer equal to one-tenth the volume of the pooled 
protein solution at the beginning of this step. 
Concentration - -The absorbance of the redissolved protein 
solution from above is determined at 280 nm and an 
approximate protein concentration is determined by 
assuming that a 1 mg/ml solution of protein has an 
absorbance of 1.0 at 280 nzn. The solution is concentrated 
to 10 mg/ml using a 200 ml Ami con stirred cell concen- 
trator with a YM-10 membrane. 
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S-300 chromatography — Thirty to seventy ml of the con- 
centrated protein solution is loaded on a 5.0 x 135 cm 
column of SEPRACRYL ® S-300 from Pharmacia. The column 
had been previously equilibrated with S-300 column 
buffer which consists of 8 M urea, 0.3 M glycine, 5 mM 
EDTA, 15 mM 2-mercaptoethanol, 1 mM DTT (pH 8.50 ± €.01). 
After loading, the column is run isocratically in the 
same buffer. Twenty ml fractions are collected and the 
fractions are assayed for pd2PBl content by SDS-PAGE. 

Equal volume aliquots are taken from suitable frac- 
tions containing pd2PBl and are used to determine which 
fractions are satisfactory for pooling. The aliquots 
are pooled, dialyzed overnight versus 8 M urea, 25 mM 
sodium phosphate, 1 mSA EDTA (pH 6.8 ± 0.1) , and the OD 
at 280 nm of the dialyzed pool is determined using the 
dialysis buffer as blank. The protein concentration 
of the solution is determined using the calculated 
extinction coefficient of pd2PBl of 1.0 (mg/ml)"^. 
SDS-PAGE is run on 10 pg of the dialyzed pooling using 
a 15% SDS acrylamide gel. After coomassie staining 
and destaining, the gel is scanned using an LKB 
(Gaithersburg, MD) scanning densitometer attached to 
a Waters (Milford, MA) 740 Integrator. If the 
pd2PBl band on the gel is more than 97% pure, then the 
fractions that were used for the a.liquot are checked 
for endotoxins at a 1 to 20 dilution • In the Limulus 
Amebocyte Lysate (IAL) assay using 0.06 eu/ml tubes. 
If the LAL test on the diluted fractions is negative, 
the fractions are pooled and used for subsequent opera- 
tions. If the gel fails to meet the purity specification 
the process is repeated using equal volume aliquots 
from a different set of " fractions . Only those fractions 
having a negative LAL test at a 1 to 20 dilution are 
pooled. 
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Table 1 

F 1 GATCAAGCTTCTGCAGTCGACGCATGCGGATCCGGTACCCGGGAGCTCG 3 1 
TTCGAAGACGTCAGCTGCGTACGCCTAGGCCATGGGCCCTCGAGCTTAA 



Table 2 



5 1 CGGTACCAGCCCGCCTAATGAGCGGGCTTTTTTTTGACGT 3 1 

TGCAGCCATGGTCGGGCGGATTACTCGCCCGAAAAAAAAC 



Table 3 



Mlul EcoRV Clal BamHI Sail Hindi II Smal 

CGAACGCGTGGCCGATATCATCGATGGATCCGTCGACAAGCTTCCCGGGAGCT 
GCTTGCGCACCGGCTATAGTAGCTACCTAGGCAGCTGTTCGAAGGGCCC 
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Table 4 



5 1 AATTCCCTGTGTGGAAGGAAGCA 
TTAAGGGACACACCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 
CATAACCATTTACACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTT 

CAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTA 
GTCTACGTACTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACAT 

AAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACT 
TTTAATTGGGGTGAGACACAATCAAATTTCACGTGACTAAACXTCTTACTATGA 

AATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC 
TTATGGTTATCATCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTG 

TGCTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTGCAGAAAGAATATGCA 
ACGAGAAAGTTATAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGT 

TTTTTTTATAAACTTGATATAATACCAATAGATAATGATACTACCAGCTATACG 
AAAAAAAT ATT^T G AACT ATATTAT GGTT ATCT ATT ACT AT GAT G GTC G AT ATGC 

TTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTT 
AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAA 

GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGT 
CTCGGTTAAGGGTATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACA 

AATAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAA 
TT ATT ATT CTGCAAGTT ACCTT GTCCT GGT ACAT GTTT A CA GTC GTGT CAT GTT 

TGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGCAGT 
ACATGTGTACCTTAATCCGGTCATCATAGTTGAGTTGACGACAATTTACCGTCA 

CTAGCAGAAGAAGAGGTAGTAATTAGATCTGCCAATTTCACAGACAATGCTAAA 
GATCGTCTTCTTCTCCATCATTAATCTAGACGGTTAAAGTGTCTGTTACGATTT 

ACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
TGGTATTATCATGTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCT2GTAAACAA 
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Table 4 (cont . ) 



A.CAATAGGAAiU\ATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAAGATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGA 3 1 

TTACT GAG GCT CT AG 
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Table 5 



5 1 CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
GACTTGGTTAGACATCTTTAATTAACATGTTC7 "GGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTGGTCCCTCTCGTAAAGAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAAGATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACGCTCCCATGCAGAATAAAACAAATTATAAAGATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGA 3 1 

TTACTCAGGCTCTAG 
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Table 6 



5 1 CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
GACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTGGTCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAAGATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTT13TTATTAGAAATTCGTCAGGAGTCGTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACAGT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCAT(pCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCT 

AGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 

GCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCA 
CGAAACAAGGAACCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGT 

ATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAG 
TACTGCGACTGCCATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTC 

AACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTC 
TTGTTAAACGACTCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAG 

TGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAG 
ACCCCGTAGTTCGTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTC 
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Table 6 (cont.) 



GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTA 

AACATGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACA 3 1 

TTGTACTGGACCTACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGA 
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Table 7 

5 • AATTCCCTGTGTGGAAGGAAGCA 
TTAAGGGACACACCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 
CATAACCATTTACACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTT 

CAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTA 
GTCTACGTACTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACAT 

AAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACT 
TTTAATTGGGGTGAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGA 

AATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC 
TTATGGTTATCATCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTG 

T G CT CTTT C AAT ATC AGC AC AAGC AT A AG AGGT AAGG TGC AG AAAGAATAT GC A 
ACGAGAAAGTTATAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGT 

TTTTTTTATAAACTTGATATAATACCAATAGATAATGATACTACCAGCTATACG 
? - JVAAAAT ATTT G AA CT ATATT AT G GTT ATCT ATT ACT AT GAT G GT CG AT AT G C 
« 

TTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTT 
AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAA 

GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGT 
CTCGGTTAAGGGTATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACA 

AATAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAA 
TTATTATTCTGCAAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTT 

TGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGCAGT 
A CAT GT GTA CCTT AAT C CGGTC AT CAT A GTT GAGTT G ACG AC AATTT AC CGTC A 

CTAGCAGAAGAAGAGGTAGTAATTAGATCTGCCAATTTCACAGACAATGCTAAA 
GATCGTCTTCTTCTCCATCATTAATCTAGACGGTTAAAGTGTCTGTTACGATTT 

ACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
TGGTATTATCATGTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 
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Table 7 (cont.) 



AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 

tgcgtgtcaaaattaacacctccccttaaaaagAtgacattaagttgtgtxgac 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCT 

A GT G AATT AT AT AAAT AT AA AGT AGT AAAAATT GAACCAT T AGG AGT AGC AC CC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 

GCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCA 
CGAAACAAGGAACCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGT 

ATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAG 
TACTGCGACTGCCATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTC 

AACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTC 
TTGTTAAACGACTCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAG 

' TGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAG 
ACCCCGTAGTTCGTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTC 

GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTA 

AACATGACCTGGATGGAGTGGOACAGAGAAATTAACAATTACACA 3 1 

TTGTACTGGACCTACCTCACCCTGTCTCTTT.vaTTGTTAATGTGTTCGA 
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Table 8 

Amino acid sequence of fusion procein RIO 

MetLeuArg 

ProValGluTh'rProThrArgGlulleLysLysLeuAspGlyLeuTrpAlaPhe 

SerLeuAspArgGluAsnCysGlylleAspGlhPheProValTrpLysGIuAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHis 

AsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGluVal 

ValLeuValAsnValThrGluAsnPheAsnMetTrpLysAsnAspMetValGlu 

GInMetEisGluAspIlelleSerLeuTrpAspGlnSerLeuLysProCysVal 

LysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThr 

AsnThrAsnSerSe rSerGlyArgMetlleMetGluLysGlyGluIleLysAsn 

CysSerPheAsnlleSerThrSerlleArgGlyLysValGlnLysGluTyrAla 

PhePheTyrLysLeuAspIlelleProIIeAspAsnAspThrThrSerTyrThr 

LeuThr Se rCy s AsnThr Se rVal IleTh rGlnAlaCysP roLys ValSe rPhe 

GluProIleProIleEisTyrCysAlaProAlaGiyPheAlalleLeuLysCys 

AsnAsnLysThr?heAsnGiyThrGly?roCysThrA.snValSerThrValGln 

CysThrHisGlylleArgProValValSerThrGlnLeuLeuLeuAsnGlySer 

LeuAlaGluGluGiuValVallleArgSerAlaAsnPheThrAspAsnAlaLys 

ThrllelleValGInLeuAsnGinSerValGluIleAsnCysThrArgProAsn 

AsnAsnThrArgLysSerlieArglleGlnArgGIyProGlyArgAIaPheVal 

ThrlleGlyLysIieGlyAsnMefcArcGlnAlaHisCysAsnlleSerArgAla 

LysTrpAsnAsnThrLeuLysGinlleAspSerLysLeoArgGluGlnPheGly 

AsnAsnLysThrllellePheLysGlnSerSerGlyGIyAspProGiuIleVal 

ThrHisSerPheAsnCysGlyGiyGiuPhePheTyrCysAsnSerThrGlnLeu 

PheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGiySerAsnAsnThr 

GluGlySerAspThrlleThrLeuProCysArglleLysGinllelleAsnMet 

TrpGInGiuValGlyLysAlaMetTyrAlaPrcProIIeSerGlyGlnlleArc 

CysSerSerAsnlleThrGiyLeuLeuLeuThrArgAspGiyGiyAsnSerAsp. 
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Table 8 (cone.) 
AsnGluSerGlalleHisArgSerValMetLeuTyrThrThrProAsnThrTrp 
Val AspAsp I leTh r Val ValTh r H is Val AlaGln AspCy s AsnHis Al aSe r 
ValAspTrpGlaValValAlaAsnGlyAspValSerValGluLeuArgAspAla 
AspGlnGlnValValAlaThrGlyGlnGlyThrSerGlyThrLeuGlnValVal 
AsnProHisLeaTcpGlnProGLyGluGlyTyrLeuTyrGluIxeuCysValThr 
AlaLysSerGlnThrGluCysAspIleTyrProLeuArgVaiGlylleArgSec 
ValAlaValLysGlyGluGlnPheLeuIleAsnHisLysProPheTyrPheThr 
GlyPheGIyArgHisGIuAspAiaAspLeuArgGlyLysGlyPheAspAjsnVal 
LeuMetValRisAspHisAlaLeviMetAspTrpIleGIyAIaAsnSerTyrArc 
ThrSerHisTyrProTyrAIaGluGluHetLeuAspTrpAlaAspGluHisGIy 
IleValVallleAspGIuThrAIaAIaValGIyPheAsnLeuSerLeuGlylle 
GlyPheGluAlaGlyAsnLysProLysGluLeuTyrSerGIuGloAlaVaiAsn 
GiyGluThrGlnGlnAlaEisLeuGlrxAlalleLysGluLeulIeAlaArgAsp 
LysAsnHisProSerValValMetTrpSerlleAlaAsnGluProAspThrArg 
ProGlnGIyAIaArgGluTyrPheAlaProLeuAlaGluAIaThrArcLysLeu 
AspP r oTh r A r gp ro 1 1 eTh r Cy s Va I As nValKe tpneCy s Asp AlaE isTh r 
AspThrlleSerAspLeuPheAspValLeuCysLeuAsnArgTyrTyrGlyTrp 
TyrValGlnSerGlyAspLeuGluThrAIaGluLysValLeuGluLysGluLeu 
LeuAIaTrpGlnGluLysLeuHisGInProIlellelleThrGIuTyrGlyVal 
AspThrLeuAlaGlyLeaHisSerM=tTyrThrAspMetTrpSerGluGluTyr 
GInCysAlaTrpLeuAspMetTyrEisArgValPheAspArgValSerAlaVal 
ValGlyGluGlnValTrpAsnPheAlaAspPheAlaThrSerGlnGlylleLea 
ArgValGIyGIyAsnLysLysGlyllePheThrArgAspArgLysProLysSer 
AlaAlaPhel,euLeuGlni»ysArgTrpThrGlyMetAsiiPheGiyGluiysPro 
GlnGlnGiyGlyLysGln 
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Table 8A 

Nucleotide sequence encoding fusion protein RIO 



ATGTTACGT 
TACAATGCA 

CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTC 
GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

AGTCTGGATCGCGAAAACTGTGGAATTGATCAATTCCCTGTGTGGAAGGAAGCA 
TCAGACCTAGCGCTTTTGACACCTTAACTAGTTAAGGGACA<^CCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCA^GTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 
CATAACCATTTACACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTT 

CAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTA 
GT CT ACGTACTCCTAT ATT AGTCAAAT AC C CTAGTTTCGG ATTTCGGTACACAT 

AAATTAAC CCCACTCTGTGTT AGTTT AAAGTGC ACTGATTTGAAGAATGAT ACT 
TTTAATTGGGGTGAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGA 

AATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC 
TTATGGTTATCATCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTG 

TGCTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTGCAGAAAGAATATGCA 
ACGAGAAAGTTATAGTCGTGTTCGTATTCTCCATTQZACGTCTTTCTTATACGT 

TTTTTTTATAAACTTGATATAATACCAATAGATAATGATACTACCAGCTATACG 
AAAAAAATATTTGAACTATATTATGGTTATCTATTACTATGATGGTCGATATGC 

T TGAC AAGTTGTAACACCTC AGTCATTAC AC AGGCCTGTC C^-AAGGTATCCTTT 
AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCC^ 

GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGT 
CT C G GTT AAG G GTATGTAATAAC AC GGGGC C G ACC^JIACGCT AAG ATTTTA CA 

AATAATAAGACGTTCAATGGAACAGGACCATGTACAA^ 

TTATTATTCTGCAAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTT 

TGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGCAGT 
ACATGTGTACCTTAATCCGGTCATCATAGTTGAGTTGACGACAATTTACCGTCA 

CTAGCAGAAGAAGAGGTAGTAATTAGATCTGCGAATTTCACAGAC^^ 

GAT C GTCTTCTT CT CC AT CATT AATCT AG ACGGTT AAAGT GT CT GTTACG ATTT 

ACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
TGGTATTATCATGTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 
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AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGO^TTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTT.TTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATT AT CAT G AACC AAATT AT C ATG AACCT CAT G ATTTC C CAGTTT ATT GTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTT C CTT C ACTGTGTT AGTGGG AG GGT ACGTCT T ATTTTGTTT AAT ATTTGT AC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGA.GGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAAC^AGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCCATCGCAGCGTAATGCTCTACACCACGCCGAACACCTGG 
TTACTCAGGCTCTAGGTAGCGTCGCATTACGAGATGTGGTGCGGCTTGTGGACC 

GTGGACGATATCACCGTGGTGACGCATGTCGCGCAAGACTGTAACCACGCGTCT 
CACCTGCTATAGTGGCACCACTGCGTACAGCGCGTTCTGACATTGGTGCGCAGA 

GTTGACTGGCAGGTGGTGGCCAATGGTGATGTCAGCGTTGAACTGCGTGATGCG 
CAACTGACCGTCCACCACCGGTTACCACTACAGTCGCAACTTGACGCACTACGC 

GATCAACAGGTGGTTGCAACTGGACAAGGCACTAGCGGGACTTTGCAAGTGGTG 
CTAGTTGTCCACCAACGTTGACCTGTTCCGTGATCGCCCTGAAACGTTCACCAC 

AATCCGCACCTCTGGCAACCGGGTGAAGGTTATCTCTATGAACTGTGCGTCACA 
TTAGGCGTGGAGACCGTTGGCCCACTTCCAATAGAGATACTTGACACGCAGTGT 

GCCAAAAGCCAGACAGAGTGTGATATCTACCCGCTTCGCGTCGGCATCCGGTCA 
CGGTTTTCGGTCTGTCTCACACTATAGATGGGCGAAGCGCAGCCGTAGGCCAGT 

GTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACT 
CACCGTCACTTCCCGCTTGTCAAGGACTAATTGGTGTTTGGCAAGATGAAATGA 

GGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTG 
CCC-AAACCAGCAGCACTTCTACGCCTGAACGCACCGTTTCCTAAGCTATTGCAC 

CTGATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGT 
GACTACCACGTGCTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCA 
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Table 8A (cone.) 



ACCTCGCATTACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGC 
TGGAGCGTAATGGGAATGCGACTTCTCTACGAGCTGACCCGTCTACTTGTACCG 

ATCGTGGTGATTGATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATT 
TAGCACCACTAACTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAA 

GGTTTCGAAGCGGGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAAC 
CCAAAGCTTCGCCCGTTGTTCGGCTTTCTTGACATGTCGCTTCTCCGTCAGTTG 

GGGGAAACTCAGCAAGCGCACTTACAGGCGATTAAAGAGCTGATAGCGCGTGAC 
CCCCTTTGAGTCGTTCGCGTGAATGTCCGCTAATTTCTCGACTATCGCGCACTG 

AAAAACCACCCAAGCGTGGTGATGTGGAGTATTGCCAACGAACCGGATACCCGT 
TTTTTGGTGGGTTCGCACCACTACACCTCATAACGGTTGCTTGGCCTATGGGCA 

CCGCAAGGTGCACGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTC 
GGCGTTCCACGTGCCCTTATAAAGCGCGGTGACCGCCTTCGTTGCGCATTTGAG 

GACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACC 
CTGGGCTGCGCAGGCTAGTGGACGCAGTTACATTACAAGACGCTGCGAGTGTGG 

GATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGGATGG 
CTATGGTAGTCGCTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACC 

TATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTT 
ATACAGGTTTCGCCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTCTTGAA 

CTGGCCTGGCAGGAGAAACTGCATCAGCCGATTATCATCACCGAATACGGCGTG 
GACCGGACCGTCCTCTTTGACGTAGTCGGCTAATAGTAGTGGCTTATGCCGCAC 

GATACGTTAGCCGGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTAT 
CTATGCAATCGGCCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATA 

CAGTGTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTC 
GTCACACGTACCGACCTATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAG 

GTCGGTGAACAGGTATGGAATTTCGCGGATTTTGCGACCTCGCAAGGCATATTG 
CAGCCACTTGTCCATACCTTAAAGCGGCTAAAACGCTGGAGCGTTCCGTATAAC 

CGCGTTGGCGGTAACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCG 
GCGCAACCGCCATTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGCTTCAGC 

GCGGCTTTTCTGCTGCAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCG 
CGCCGAAAAGACGACGTTTTTGCGACCTGACCGTACTTGAAGCCACTTTTTGGC 

CAGCAGGGAGGCAAACAA 
GTCGTCCCTCCGTTTGTT 
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Table 9 

Amino acid sequence of fusion protein PBl 

MetLeuArg 

PsroValGiuThrProThrArgGluIleLysLysLeuAspGlyLeuTrpAlaPhe 
SerLeuAspArgGluArgValAlaAspLeuAsnGInSerValGluIleAsnCys 
ThrArgProAsnAsnAsnThrArgLysSerlleArglleGlnArgGlyProGly 
ArgAlaPheValThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsn 
IleSerArgAlaLysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArg 
GluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAsp 
ProGluIleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsn 
SerThrGlnLeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGly 
SerAsnAsnThrGluGlySerAspThrlleThrLeuProCysArglleLysGln 
IlelleAsnMetTrpGlnGluValGlyLysAlaMetTyrAlaProProlleSer 
GlyGlnlleArgCysSerSecAsnlieThrGlyLeuLeuLeuThrArgAspGly 
GlyAsnSerAsnAsnGluSerGluIleArgArgGlnAlaSerArgGluLeuGIu 
PheLeuLysThrLysGlyProArgAspThrProIlePhelieGly v 
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Table 9A 

Nuclepcide sequence encoding fusion protein PBl 



ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTG 
TACAATGCAGGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGAC 

TGGGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTGAACCAATCTGTAGAA 
ACCCGTAAGTCAGACCTAGCGCTTGCGCACCGGCTAGACTTGGTTAGACATCTT 

ATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGTATCCGTATCCAGAGA 
TAATTAACATGTTCTGGGTTGTTGTTATGTTCTTTTTCATAGGCATAGGTCTCT 

GGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCA 
CCTAATCCCTCTCGTAAACAATGTTATCCTTTTTATCCTTTATACTCTGTTCGT 

CATTGTAACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGATAGC 
GTAACATTGTAATCATCTCGTTTTACCTTATTGTGAAATTTTGTCTATCTATCG 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAGTCCTCA 
TTTAATTCTCTTGTTAAACCTTTATTATTTTGTTATTAGAAATTCGTCAGGAGT 

GGAGGGGACCCAGAAATTGTAACGCAGAGTTTTAATTGTGGAGGGGAATTTTTC 
CCTCCCCTGGGTCTTTAACATTGCGTGTCAAAATTAACACCTCCCCTTAAAAAG 

TACTGTAATTCAACACAACTGTTTAATAGTACTTGGTTTAATAGTACTTGGAGT 
ATGACATTAAGTTGTGTTGACAAATTATCATGAACCAAATTATCATGAACCTCA 

ACTAAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACCCTCCCATGCAGA 
TGATTTCCCAGTTTATTGTGACTTCCTTCACTGTGTTAGTGGGAGGGTACGTCT 

ATAAAACAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 
TATTTTGTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTACATACGGGGA 

CCCATCAGTGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACA 
GGGTAGTCACCTGTTTAATCTACAAGTAGTTTATAATGTCCCGACGATAATTCT 

AGAGATGGTGGTAATAGCAACAATGAGTCCGAGATCCGTCGACAAGCTTCCCGG 
TCTCTACCACCATTATCGTTGTTACTCAGGCrCTAGGCAGCTGOTCGAAGGGCC 

GAGGXCGAATTCTTGAAGACGAAAGGGCCTCGTGATACTCCTATTTTTATAGGT 
CTCGAGCTTAAGAACTTCTGCTTTCCCGGAGCACTATGCGGATAAAAATATCCA 
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Table 10 

Amino acid sequence of fusion protein 590 

MetLeuArgProValGluThr 
PcoThrArgGlulleLysLysLeuAspGlyLeuTrpAlaPheSerLeuAspArg 
SluArgValAlaAspLeuAsnGlnSecValGluIleAsnCysThrArgProAsn 
AsnAsnThrArgLysSerlleArglleGlnArgGlyProGlyArgAlaPheVal 
ThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAla 
LysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArgGluGlnPheGly 
AsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGluIleVal 
ThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerTbrGlnLeu 
PheAsnSerThrTrpFheAsnSerThrTrpSerThrLysGlySerAsnAsnThr 
GluGIySerAspThrlleThrLeuProCysArglleLysGlnllelleAsnMet 
TrpGlnGluValGiyLysAlaMetTyrAlaProProileSerGlyGlnlleArg 
CysSer-SerAsnlleThrGiyLeuLeuLeuThrArgAspGlyGlyAsnSerAsn 
AsnGluSerGluIlePheArgProGlyGlyGlyAspMetArgAspAsnTrpArg 
SerGluLeuTyrLysTyrLysValValLysIleGluProLeuGlyValAlaPro 
ThrLysAlaLysArgArgValvalGlnArgGIuLysArgAlaValGlylleGly 
AlaLeuPheLeuGlyPheLeuGlyAleAlaGlySerThrMetGlyAlaAlaSer 
MetThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGInGIn 
AsnAsnLeuLeuArgAlaIleGl«2AlaGinGInHisLeuLeuGInLeuThrVal 
TrpGlyilelxysGlnLeuGInAlaArglleLeuAlaValGluArgTyrLeuLys 
AspGlnGlnlieuLeuGlylieTrpGlyCysSerGIyl.ysLeuIleCysThrThr 
AlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGiuGlnlleTrpAsn 
AsnMetThrTrpMefcGluTrpAspArgGluIleAsnAsnTyrThrSerPhePro 
IleHisArgSerValMetLeuTyrThrThrProAsnThrTrpValAspAspIle 
ThrValValThrfiisValAlaGlnAspCysAsnHisAlaSerValAspTrpGln 
ValValAlaAsnGlyAspValSerValGIuLeuArgAspAIaAspGlnGlnVal 
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Table 10 (cont.) 

ValAlaThrGlyGlnGlyThrSetGlyThrLeaGlnValValAsnProHisLeu 

TrpGlnProGlyGluGlyTyrLeuTyrGluLeuCysValThrAlaLysSerGln 

ThrGluCysAspIieTyrProLeuArgValGlylleArgSerValAlaValLys 

GlyGluGlnPheLeuIleAsnKisLysProPheTyrPheThrGlyPheGlyArg 

HisGluAspAlaAspLeuArgGlyLysGlyPheAspAsnValLeuMetValHis 

AspHisAlaLeuMetAspTrpIleGlyAlaAsnSerTyrArgThrSecHisTyr 

ProTyrAlaGluGluMetLeuAspTrpAlaAspGluHisGlylleValVallle 

AspGluThrAlaAlaValGlyPheAsnLeuSerLeuGlylleGlyPheGluAla 

GlyAsnLysProLysGluLeuTyrSerGluGluAlaValAsnGlyGluThrGIn 

GInAlaHisLeuGlnAlalleLysGlaLeuIleAlaArgAspLysAsnKisPro 

SerValValMetTrpSerXleAlaAsnGluProAspThrArgPrcGlnGlyAla 

ArgGluTyrPheAlaProLeuAlaGlaAlaThrArgLysLeuAspProThrArg 
i 

ProIleThrCysValAsnValMetPheCysAspAlaBisThrAspThrlleSer 
AspLeuPheAspValLeuCysLeuAsnArgTyrTyrGIyTrpTyrValGlnSer 
GlyAspLeuGiuThrAlaGluLysValLeuGluLysGluLeuLeuAlaTrpGIn 
GluLysLeuHisGlnProIlellelleThrGluTyrGlyValAspThrLeuAIa 
GlyLeuHisSerMetTyrThrAspMetTrpSerGluGluTyrGlnCysAlaTrp 
LeuAspMetTyrHisArgValPheAspArgValSerAiaValValGlyGluGIn 
ValTrp^snPheAlaAspPheAlaThrSerGlnGlylleLeuArgValGIyGIy 
AsnLysLysGlyIlePheThrArgAspArgLysPrcLysS2rAlaAlaPhel,eu 
LeuGlnLysArgTrpThrGiyMetAsnPheGlyGIuLysProGlnGlnGlyGly 
LysGln 
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Table 10A 

Nucleotide sequence encoding fusion protein 590 



ATGTTACGTCCTGTAGAAACC 
T AC AAT G C AGG ACAT CTTTG G 

CC.^ACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTCAGTCTGGATCGC 
GGTTGGGCACTTTAGTTTTTTGAGATGCCGGACACCCGTAAGTCAGACCTAGCG 

GAACGCGTGGCCGATCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
CTTGCGCACCGGCTAGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCCCTCTTAACCTCT 

AGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 
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Table 10A (cont.) 



GCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCA 
CGAAACAAGGAACCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGT 

ATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAG 
TACTGCGACTGCCATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTC 

AACAATTTGCTGAGGGCTATTGAGGGGCAACAGCATCTGTTGCAACTCACAGTC 
TTGTTAAACGACTCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAG 

TGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAG 
ACCCCGTAGTTCGTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTC 

GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCGAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTA 

AACATGACCTGGATGG.AGTGGGACAGAGAAATTAACAATTACACAAGCTTCCCG 
TTGTACTGGACCTACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGAAGGGC 

ATCCATCGCAGCGT^ATGCTCTACACCACGCCGAACACCTGGGTGGACGATATC 
TAGGTAGCGTCGCATTACGAGATGTGGTGCGGCTTGTGGAGCCACCTGCTATAG 

ACCGTGGTGACGCATGTCGCGCAAGACTGTAACCACGCGTCTGTTGACTGGCAG 
TGGCACCACTGCGTACAGCGCGTTCTGACATTGGTGCGCAGACAACTGACCGTC 

GTGGTGGCCAATGGTGATGTCAGCGTTGAACTGCGTGATGCGGATCAACAGGTG 
CACCACCGGTTACCACTACAGTCGCAACTTGACGCACTACGCCTAGTTGTCCAC 

GTTGCAACTGGACAAGGCACTAGCGGGACTTTGCAAGTGGTGAATCCGCACCTC 
CAACGTTGACCTGTTCCGTGATCGCCCTGAAACGTTCACCACTTAGGCGTGGAG 

TGGCAACCGGGTGAAGGTTATCTCTATGAACTGTGCGTCACAGCCAAAAGCCAG 
ACCGTTGGCCCACTTCCAATAGAGATACTTGACACGCAGTGTCGGTTTTCGGTC 

ACAGAGTGTGATATCTACCCGCTTCGCGTCGGCATCCGGTCAGTGGCAGTGAAG 
TGTCTCACACTATAGATGGGCGAAGCGCAGCCGTAGGCCAGTCACCGTCACT7C 

GGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACTGGCTTTGGTCGT 
CCGCTTGTCAAGGACTAATTGGTGTTTGGCAAGATGAAATGACCGAAACCAGCA 

CATGAAGATGCGGACTTGCGTGGCAAAGGAXTCG^TAACGTGCTGATGGTGCAC 
GCACTTCTACGCCTGAACGCACCGTTTCCTAA^CTATTGCACGACTACCACGTG 

GACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCATTAC 
CTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCATGGAGCGTAATG 
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Table LOA (cone J. 



CCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATT 
GGAATGCGACXTCTCTACGAGCTGACCCGTCTACTTGTACCGTAGCACCACTAA 

GATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATTGGTTTCGAAGCG 
CTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAACCAAAGCTTCGC 

GGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAG 
CCGTTGTTCGGCTTTCTTGACATGTCGCTTCTCCGTCAGTTGCCCCTTTGAGTC 

CAA.GCGCACTTA.CAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCA 
GTTCGCGTGAATGTCCGCTAATTTCTCGACTATCGCGCACTGTTTTTGGTGGGT 

AGCGTGGTGATGTGGAGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCA 
TCGCACCACTACACCTCATAACGGTTGCTTGGCCTATGGGCAGGCGTTCCACGT 

CGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTCGACCCGACGCGT 
GCCCTTAXAAAGCGCGGTGACCGCCTTCGTTGCGCATTl'GAGCTGGGCTGCGCA 

CCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACCGATACCATCAGC 
GGCTAGTGGACGCAGTTACA.TTACAAGACGCTGCGAGTGTGGCTATGGTAGTCG 

GATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGGATGGTATGTCCAAAGC 
CTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACCATACAGGTTTCG 

GGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTTCTGGCCTGGCAG 
CCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTCTTGAAGACCGGACCGTC 

GAGAAACTGCATCAGCCGATTATCATCACCGAATACGGCGTGGATACGTThGCC 
CTCTTTGACGTAGTCGGCTAATAGTAGTGGCTTATGCCGCACCTATGCAATCGG 

GGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTATCAGTGTGCATGG 
CCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATAGTCACACGTACC 

CTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACAG 
GACCTATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAGCAGCCACTTGTC 

GTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGT 
CATACCTTAAAGCGGCTAAAACGCTGGAGCGTTCCGTATAACGCGCAACCGCCA 

AACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTT1TCTG 
TTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGCTTCAGCCGCCGAAAAGAC 

CTGCAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGC 
GACGTTTTT GCG ACCTGACC GTA CTT G AAGCCACTTTTT GGCGTCGTCCCTCCG 



AAACAA 
TTTGTT 
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Table LI 

Amino acid sequence of fusion protein KR1 

MetLeuArg 

ProVaiGluThrProThrArgGlulleLysLysLeuAspGlyLeuTrpAlaPhe 
SerLeuAspArgGluArgGluPheProValTrpLysGluAlaThrThrThcLeu 

Ph eCy s Al a Se r AspAl aLy s Al aTy r AspTh rG 1 u valH is AsnValT rpAl a 

ThrHisAlaCysValProThrAspFrDAsnProGlnGluValValLeuVaiAsn 

ValThrGluAsnPheAsnMetTrpLysAsnAspMetValGluGlnMetHisGlu 

AspIlelleSerLeuTrpAspGinSerLeuLysProCysValLysLeuThrPro 

LeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThrAsnSer 

SerSerGlyArgMetlleMetGluLysGlyGluIieLysAsnCysSerPheAsn 

IleSerThrSerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrliys 

LeuAspIlelleProIleAspAsnAspThrThrSerTyrThrLeuThrSerCys 

AsnThrSerValXleThrGlnAlaCysProLysVaiSerPheGluProilePro 

IleHisTyrCysAlaProAlaGlyPheAialleLeuLysCysAsnAsnLysThr 

PheAsnGlyThcGlyProCysThrAsnVaiSerThrValGinCysThrHisGly 

IleArgProValVaiSerThcGinLeuLeuLeuAsnGlySerLeuAlaGlaGlu 

GluValVaill =ArgSerAlaAsn?heahrAspAsnAiaLysThrIieIieVal 

GlnLeuAsnGlnSerValGl^IleAsnCysThrArcProAsnAsnAsnThrArg 

LysSerlleArglieGlnArgGlyProGly/ rgAlaPheValThrlleGlyLys 

IleGlyAsnHetArgGlaAiaBisCysAsnlleSarArgAlaLysTrpAsnAsn 

ThrLeuLysGlnll^AspSerLysLeuArgGluGlnPheGlyAsnAsnLysThr 

IlellePheLysGlnScrSerGlyGlyAspProGluXleValThrBisSerPhe 

AsnCysGlyGlyGluPh^PheTyrCysAsnSerThrGlnLeuPheAsnSecThr 

TrpPheAsnSerThrTrpSer^hrLysGlySerAsnAsnThrGluGiySerAsp 

ThrlleThrLtruProCysArglleLysGinllelleAsnMefcTrpGlnGluVal 
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Table 11 (cone.) 

GlyLvsAlaMetTyrAlaProProIleSerGIyGlnlleArgCysSerSerAsn 
lieThrGlyLeaLeuLeuThrArcAspGlyGlyAsnSerAsnAsnGluSerGlu 
IlePheArgProGlyGlyGlyAspHetArgAspAsnTrpArgSerGluLeuTyr 
LysTyrLysValValLysIleGlu?roLeuGIyValAla?roThrLysAlaLys 
ArgArgVaiValGlnArgGluLysArcAlaValGlylleGlyAlaLeuPheLeu 
GlyFheLeuGIy&IaAIaGlySerTh^ 

ValGlnAIaArcGlr^LeuLeuSerGlylleValGlnGlnGIiiAsnAsnL^uLea 
ArgAlalleGluAlaGlnGlnHisLeuL&uGlnLeuThrValTrpGlyXleLys 
GlnLeuGlnAlaArglleLeuAiaValGluArgTyrLeuLysAspGInGlnLeu 
LeuGlylleTrpGlyCysSerGIyLysLeuIleCysTharThrAlaValProTrp 
AsnAlaSerTrpSerAsnLysSerLeuGIuGlnlleTrpAsnAsnMetThrTrp 
MetGIuTrpAspArgGluIleAsnAsnTyrThrSerPbeProGlyAlaArcIle 
LeuGluAspGIuArgAlaSer 
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Table 11A 

Nucleotide sequence encoding fusion protein KH1 



ATGTTACG? 
TACAATGCA 

CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTC 
GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

AGTCTGGATCGCGAACGCGAATTCCCTGTGTGGAAGGAAGCAACCACCACTCTA 
TCAGACCTAGCGCTTGCGCTTAAGGGACACACCTTCCTTCGTTGGTGGTGAGAT" 

TTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCC 
AAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTATTACAAACCCGG 

ACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTAT.TGGTAAAT 
TGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCATCATAACCATTTA 

GTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAACAGATGCATGAG 
CACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTTGTCTACGTACTC 

GATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCA 
CTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACATTTTAATTGGGGT 

CTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACCAATAGT 
GAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGATTATGGTTATCA 

AGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAAT 
TCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTGACGAGAAAGTTA 

ATCAGCA.CAAGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAA 
TAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGTAAAAAAATATTT 

CTTGATATAATACCAATAGATAATGATACTACCAGCTATACGTTGACAAGTTGT 
GAACTATATTATGGTTATCTATTACTATGATGGTCGATA.TGCAACTGTTCAACA 

AACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTTGAGCCAATTCCC 
TTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAACTCGGTTAA GGG 

• ATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACATTATTATTCTGC 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAA.TGTACACATGGA 
AAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTTACATGTGTACCT 

ATTAGGCCAGTAGTATCAACTC^VACTGCTGTTAAATGGCAGTCTAGCAGAJ^GAA 
T AAT CCGGTCATC AT A GTT G AGTT GACG ACAATTT ACCGTC AGAT C GTCTTCTT 

GAGGTAGTAATTAGATCTGCCAATTTCACAGACAATGCTAAAACCATAATAGTA 
CT C CATC ATT AAT CT AG AC GGTT AAAGTGTCTGTTACG ATTTT GGT ATT AT C AT 
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CAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGA 
GTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTGTTGTTATGTTCT 

AAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTTACAATAGGAAAA 
TTTTCATAGGCATAGGTCTCTCCTGGTCCCTCTCGTAAACAATGTTATCCTTTT 

ATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAAC 
TATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGTTTTACCTTATTG 

ACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGAAATAATAAAACA 
TGAAATT TTGTCT ATCT ATCGTTT AATTCTCTTGTTAAAC CTTTATT ATTTT G T 

ATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTAAGGCAChGTTTT 
TATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACATTGCGTGTCAAAA 

AATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACT 
TTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGACAAATTATCATGA 

TGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGAC 
ACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGACTTCCTTCACTG 

ACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGAAGTA 
TGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTACACCGTCCTTCAT 

GG AAAAG CAA-T GTATGCCCCTCC CAT CAGTGG AC AAATT A GATGTTC AT CAAAT 
CCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCTACAAGTAGTTTA 

AT T A CA GG G CT GCT ATT A A CAAG AG AT GGTGGTAATAGCAACAATGAGTCCG AG 
TAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTGTTACTCAGGCTC 

ATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATAT 
T AGAAGTCTG G AC CT CCTCCTCT AT ACTC C CT GTTAACCT CTTC ACTT AAT AT A 

AAATATAAAGTAGTAAAAATTGAACCATTA.GGAGTAGCACCCACGAAGGCAAA^ 
TTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGGTGGTTCCGTTTC 

AGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 
TCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCTCGAAACAAGGAA 

-GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATGACGCTGACG 
CCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGTTACTGCGACTGC 

GTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAATTTGCTG 
CATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTCTTGTTAAACGAC 

AGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAG 
TCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAGACCCCGTAGTTC 

C A G CT C CA G G CAAG AAT CCT GG CT GT GG AAA GAT A C CT AAA. G GAT C AAC A G CT C 
GTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTCCTAGTTGTCGAG 
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CTGGGGATTTGGGGTTGCTCTGGi^AACTCA'rrTGCACCACTGCTGTGCCTTGG 
GACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGACGACACGGAACC 

AATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGG 
TTACGATCAAGCTCATTATTTAGAGACCTTGTCTAAACCTTATTGTACTGGACC 

ATGG AGT GGG ACAGAGAAATTAACAATTACACA AG CTTCCCG G GAGCTC GAATT 
TACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGAAGGGCCCTCGAGCTTAA 

CTTGAAGACGAAAGGGCCTCG 
GAACTTCTGCTTTCCCGGAGC 
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Table 12 

Amino acid sequence of HIV portion of protein RIO 

MetV3lTrpL«sGluAl3ThrThpThrLeuPheCysAlaSerAspAlaLysAl3Tyr 
AspTKrGluVelHisAsriValTrpAlBThrHisAIeGhsValPrcThrA^pPro 
AsnProGlriGluValValLeuValAsnValThrGluAsnPheAsriMetTrpLvs 
AsnAspttPtValGIuGIrilletHisGluAspIlGlleSerLeuTrpAspGlriSGr 
LeuLssProCysValLvsLeuTh rP roLeuC*$Vd 1 Se rLeuLysGysTh r Asp 
LeuLysAsnAspThrAsnThrAsriSerSBrSerGlyAr^tlleMetGluLys 
Gl^GluIleLysAsnCysSerPheAsnlleSerThrBerlleAr^GlyL^sVel 
GlnLysGluTy rAlaPhePheT^rLysLeuAspIlel leProIleAspAsnAsp 
ThrThrSerTy rThrLeuTh rSe rCysAsnThrSe rVal IleThrGlnAlaCys 
ProLysVslSerPheGluProIleProI UHxsTyrCysAlsProAl^lyPhe 
Alal leLeuLasCHsAsriAsnLysThrPheAsnGlyThrGlyProCysThrAsn 
VslSerThrValGlr.CysThrHisGlylleAr^ProValUalSerTh rGlnLeu 
LeuLeuAsnGlySe rLeuAl aGluGluGIuValVal 11 eArsSer AlaAsrrPhe 
ThrAspAsnAlsLysThrllel leValGlnLeuAsnGInSerValGluI leAsn 
CysThrAr^FroAsnAsnAsriThrAr^LysSerlleAr^IleGlriAr^GlyPro 
GlyAraAlsPheValThrlleGlyLysIleGlyAsnMetAr^GlnAlaHisCys 
Asnl 1 eSerAr^AlaLysTrpAsriAsnThrLeuLysGlnlleAspSerLysLeu 
AraGluGlnPheGlyAsriAsnL^sThrl lellePheLysGlnSerSerGlyGly 
AspProGluIleVslThrHisSerPheAsnCysGlyGlyGluPhePheTyrCys 
AsnSe rTh rGlnLeuPhe AsnSe TThrTrpPheAsnSerThrTrpSerThrLys 
GlySerAsriAsnThrGluGlySerAspThrlleThrLeuProCysArsIleLys 
GlnllelleAsrittetTrpGlnGluValGlyLysAlaMetTyrAlaProProIle 
SerGlyGlnlleArgCvsSerSerAsnl leThrGlyLeuLeuLeuThrAr^Asp 
GlyGlyAsnSerAsnA snG luSer 
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Table 12A 
Nucleotide sequence encoding 
HIV portion of protein RIO 

A~GG TGTGG A AGGAAGCA ACC ACCACTCTATTTTGTGCATCAG ATGCTAA AGCA T AT 
GAT AC AG AG GT AC AT A A TGI TTGG6CCACACAT6CCTG7GTACCCACAGACCCC 
AACCCACAAGA AGTAGTATTGGTAAATGTGACAGAAA AT T TT A ACATGTGGAAA 
AATGACATGGT AGAAC AGATGCATGAGGATaTAATCAGTTTATGGGATCAAA£C 
CT AAAGCCA TGTGTA AAATTAACCCCACTCTGTGT TAGTTTAA AGTGCaCTGAT 
TTGAAGAATGATACTAATACCAATAGTAGTaGCGGGAGAATGATAATGGAGAAA 

gg ag ag at a aaa aactgctctttcaatatcagc ac a agca t a ag aggtaaggtg 
cagaaagaatatgcatttttttataaacttgatataataccaatagataatgat 
act accagctatacgttg acaagttgtaacacctc agtcatt acacaggcctgt 
ccaa^ggtatcctttgagccaattcccatacattattgtgccccggctggtttt 

GCG ATTCT AAAATGTA AT AAT AAGACGTTCAATGGA ACAGGACCATGTACAAAT 
GTChGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTG 

ctgtt a aatggc agtctggcag aagaagaggtagtaattagatctgccaatt7c 
acagacaatgctaaaaccataatagtacagctgaaccaatctgtagaaattaat 
tgtacaagacccaacaacaatacaagaaaaagtatccgtatccagagaggacca 
gggagagcatttgttacaataggaaaaataggaaatatgagacaagcacattgt 
aacattagtagagcaaaatggaataacactttaaaacagatagatagcaaatta 
agagaacaatttggaaataataaaacaataatctttaagcagtcctcaggaggg 
gacccagaaattgtaacgcacagttttaattgtggaggggaatttttctactgt 
aattcaacacaactgtttaatagtacttggtttaatagtacttggagtactaaa 
gggtcaaataacactgaaggaagtgacacaatcaccctcccatgcagaataaaa 
caaattataaacatgtggcaggaagtaggaaaagcaatgtatgcccctcccatc 
agtggacaaattagatgttcatcaaatattacagggctgctattaacaagagat 
ggtggtaatagcaacaatgagtcc 
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Table 13 

Amino acid sequence of HIV portion of protein PB1 

Met LeuftsriGlriSerValGluIleAsnC^sThrAr^ProAsnAsnAsnThrArsLtfs 
SerlleAr^IleGlnAr^GlyProGlyAr^AlaF-heUalThrlleGl^Lysrie 
GlyAsrirtetArsGlriAlsHxsCysAsnlieSerArsAlsL^sTrpAsnA^nT'n r 
UeuLysGlnl X©AsPSerL«sLeaAr5GluGlnPheG2yAsrfAsnt^sThrI le 
1 1 ePheL *%G InSe rSerG 2 yGlyAspP r oGluIl eV a ITh r Hi sSe rPhe Asn 
CysGl yGl «61uPhePheTy rC*sAsnSe rTh rGlnLeuPheAsnSe rThrT rp 
Phe A snSerThrTrf-Se rTh rLy sG lySe r AsnAsriThrG luG l^Se rAspTh r 
II sThrLegP roCysArsI leLysGlnl lei leAsnMe tT rpGlr.G luVslGly 
L^sAlaHetT^rAlaProProIleSerGlyGlnlleArsCysSerSe rAsnl 1 e 
ThrGlyLeuUeuLeuThrA rsAspGlvGl^AsnSe rAsnAsnGluSe r 
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Table 13A 
Nucleotide sequence encoding 
HIV portion of protein, PB1 

ATGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAA 
AGT ATCCGT ATCC AGAGAGGACCAGGGAGAGCATTTG7T AC AATAG6AAA AAT A 
GGAAATATGAGACA AGCACATTGTAACATTAGTAGAGCA AAATGGAATAACACT 
TThAAAC AGAT AGATAGCAAATTAAGAGAACAAT7TGGAAATAATAAAACAATA 
ATCTTTAAGCAGTCCTCAGG AGGGGACCCAG AAATTGTA ACGCACAGTTTTAAT 
TGTGGA6GGGAAT7TTTCTAGTGTAATTCA ACACAACTGTTTA ATAGTACTTGG 
TTTAATAGT ACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGACAC A 
ATCACCCTCCC ATGC A6AATAAA ACAA ATTATAAACAT6TGGCAGGAAGT AGG A 
AAAGCAATGTAT6CCCCTCCCATCAGTGGACAAATTAG ATGTTCATCAAATATT 
ACAGGGCTGCTATTAAGAAG AGAT6GTGGTAATAGCAACAATGAGTCC 
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Table 14 
Amino acid sequence of 
HIV protion of protein 590 

Men LeuAsnGlnSe rVa IGlal 1 eAsnCysThrA r^ProAsnAsnAsnThrA r^L^s 
SerlleAral leG lr, A raG lyp r oGlyA r sA 1 aPhe Va ITh r 1 1 eG 1 yLy s I le 
GlyA^nMetAr^GlriAlaHisCy^Asrill^SerArsAl eLy^T pf A^nAsrJhr 
LeuLysGlnl leAspSerLysLeuArSGluGlnPheGlyAsnAsnLysThrl 1 e 
IlePheLysGlnSe rSe rGlyGlyAspProGluI 1 eValTh rHisSe rPhe Asn 
CysGlyGlyGluPhePheTy rCysAsnSerThrGlnLeuPheAsnSerThrTrp 
PheAsnSe rTh rT rp Se rTh rLasG lySe r AsnAsnTh rG luG l^Se rAsp-Th.r 
1 leThrLeuProCysAralleLysGlnl lelleAsnMetTrpGlnGluUalGly 
LysAlaMetTy r AlaProProIleSerGlyGlnlleAr^CvsSerSerAsril le 
Th rGLvLeuLeuLeuThrA rsAsF-GlxGlxAsnSerAsnAsnGluSerGluIl e 
PheArSP roGlyGlyGlyAspMetAr^AspAsnTrpArsSe rGluLeuTy rLy s 
Ty rLvs'Ual ValLysI leG luP roLeuGIy ValAlsProThrLysAl aLysA r 2 
Ar^yalVBlGlriAr^GluLysArsAlaVelGlylleGlyAlaLeuPheLeuGly 
PheLeuGlyAlaAlsGlvSerThrMetGlyAlaAlaSerMetThrLeuThrVal 
Gi nAl aAr^Gl nL euLe uS e rG 1 yl 1 eVa 1 Gl nG lnG In Ash A snLeuLeuAr 3 
Ala I leGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGIn 
Lg.uG 1 nAlaArglleLeuAlaValGl uA r 3Ty rLeuLysAspG 1 nG 1 nLeuLeu 
Glyl leTrpGlyCysSerGlyLysLeuIleCysThrThrAlaValProTrpAsn 
AlaSerTrpSerAsnLysSerLeuGluGlnl leT rp AsnAsnMe tTh rT rpMet, 
Gl uTrpAspArSGluI leAsnAsnTyrThr 
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Table 14A 
Nucleotide sequence encoding 
HIV portion of protein 590 

ATGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAA 
AGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATA 
GG A AATATGAG AC A AGCACATTGT A ACATTAGTAGAGCAAAATGGA ATAACACT 
TT A AA AC AG AT AG AT A6CAAATTAAGAG AAC AATTTGGA AAT A ATAA AACAATA 
ATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGT AACGCACAGTTTT AAT 
TGTGGAGGGGAATTTTTCTACT6TAATTCAACACAACTGTTTAATAGTACTTGG 
TTTA AT AGT ACTTGG AGTACT A AAGGGTCAAATAACAGTGAAGGAAGTG ACACA 
ATCACCCTCCCATGCAGAATAAAACAAATT ATAAAC ATGTGGCAGGA A6TAGG A 
AAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTTCATCAAATATT 
AC AGGGCTGCT ATT AAC A AG AG A TGG TGGT AAT AGC A ACA A TGAGTCCG A GAT C 
TTCAGACCTGG AGGAGGAGATATG AGGG ACAATTGGAGA AGTGAATTATATAA A 
TAT A AAGTAGT AA AAATTGAACCATTAGGAGTA6CACCCACCAAGGCAA AG AG A 
AGAGTGGTGCAG AGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGG 
TTCTTGGGAGCAGCA6GAAGCACTATGGGCGCAGCGTCAATGACGCTGACGGTA 
CAGGCCAGACAATT ATTGTCTGGT ATAGTGCAGCAGCAG AACA ATTTGCTGAGG 
GCT ATTGAGGCGC AACAGC ATCTGTTGC A ACTC ACAGTCTGGGGCATCA AGC AG 
CTCCAGGCAAGAATCCTG.GCTGTGGAAAGAT ACCTAAAGGATCAACAGCTCCTG 
GGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 
GCTAGTTGGAGTA AT AAATCTCTGGAACAGATTTGGA ATA ACA TGACCTGGATG 
GAGTGGGAC AG AG AAATTAACAATTAC ACA 
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Table 15 
Amino acid sequence of 
HIV portion of protein KH1 
Met ^alTrpLysGluAlsThrThrThrLeuPheCysAlaSerAspAlsLysAlaTy r 

AspThrGltjV3lHisAsnV3lTrpAl3ThrHisAl3CysV3lProThrftspPro 

AsnFToGlnGluVslUslLeuValAsnValThrGluAsnPheAsnHetTrpLys 

AsnAspHetValGluGlnMetUisGluAspIle 1 1 eSe rLeuT tpAspGI r,S e r 

LeuLysProCys^slLysLeuThrProLeuCysUslSerLeuLysCysThrAsP 

LeuLysAsnAspThr AsnThrAsnSe rSerSe rG 1 yA r3fte 1 1 1 eHe tC I uLys 

GlyGluIleLysAsnCysSerPheAsnlleSerThrSerlleAraGlyLysVBl 

GlnLysGluTy rAlsPhePheTy rLysLeuAsp! lelleProI leAspAsnAsp 

ThrThrSe rTy rTh rL«uTh rSe rCys AsnTh rSe rVal 1 1 eThrG In Al aCys 

ProLysVslSerPheGluProIleProIleHisTy rCysAl sP roAl aGlyPhe 

AlalleLeuLysCysAsnAsnLysThrPheAsnGlyThrGlyProCysThrAsn 

y3l3arThrV3lGlriCysThrHi5GlyIleAP2ProU3lValSerThrGlriLeu 

LeuLeuAsnGly Sq rLeuA J aGluGluGluVal Val I leArdSe rA laAsnPhe 

ThrAspAsnAlaLwsThrllell eValGlaLeuAsnGlnSerValGluZ leAsn 

CygThrArsP ro AsnAsnAsnTh r A rsLysSe rIleAr*IleGlnA r^GlyP ro 

GlyArsAl sPheValThrl leGlyLys! leGlyAsnMetAr^GlnAlaHi sCys 

Asnl leSerA r^AlaLysTrpAsnAsriThrLetjLysGInlleAspSerLysLeu 

ArsGlsjGlnPheGlyAsnAsnLysThrllellePheLysGlriSerSerGiyGIy 

AspProGluI leVslThrHisSerPheAsnCysGlyGlyGluPhePheTy rCys 

AsnSerThrGlnLeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrLy s 

GlySe rAsnAsnThrGluGlySe rAspThrl leThrLeuProCysAr^I leLys 

GlnllelleAsriMetTrpGlnGluValGlyLysAlaHetTyrAIaProProIl© 

SerGlyGlnIleAr^CysSerSerAsnIleThpGIyLeuLeuLe»jThrAr5Asp 

GlyGlyAsnSerAsriAsriGl^JoerGluI lePheArSProGlyGlyGlyAs^Met 

A r£A s p As nl rp A r^Ss rG I uLeuTs rLy sly rLy sValUalLy 5 1 leGLuF ro 
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Table 15 (cont.) 

LeuGlyVslAlsProThrLysAlsLysAp^ArsValValGlnArsGluLasAra 
AlayalGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySe rThr 
MGiGlyAlaAlsSerttetThrLeuThrUslGlnAlaArSGlriLeuLeuSerGl^ 
1 1 eVslGlnGlriGlriAsriAsriLeuLeuAr^AlalleGluAlaGlnGlnHisLeu 
LeuGlnLeuTh rV.alTrpGlvI 1 eLysGlnLeuGlnAl ©Aral leLeuAlsVs 1 
GluArsTy rLeuL^sAspGlriGlnLeuLeuGly IleTr^GlyCysSerGl^Lys 
LeuIleCysThrThrAl2yalProTrJ*AsriAleSerTrpSerAsnLysSerLeu 
GluGlnlleTrpAsnAsriMeiThrTrphetGluTrpAspAr^GluIleAsnAsn 
TyrThr 
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Table ISA 
Nucleotide sequence encoding 
HIV portion of protein KH1 

ATGGTGTGGA AGG AAGC AACCACCACTC7A7777G7GC A7CAGA7GC7AAAGCA7A7 
G AT AC AGAGGT AC A7AA7G 77 766GCC AC AC A 7GCC7G7GTACCCAC AG ACCCC 
AACCCACAAGAAG7AG7ATTGGTAAATGTGACAGAAAAT7T7AACA7G7GGAAA 
AATG ACA7GGT AG AACAGA7GCA7GAGGATA7AATCAG777A7GGGA7C AA AGC 
CT AAAGCCATGTGT A AA ATTAACCCCACTCTGTGTTAGTTTA AAGT6CACTG AT 
77GAAGAA7GA7AC7AATACCAA7AG7AG7AGCGGGAGAATGA7AA7GGAGAAA 
GG AG AGA7 AAAAAAC7GC7C777CAA7A7CAGCACA AGCA7A AGAGGT AAGG7G 
CAG AAAGAA7A7GCA7777777A7AAAC77GA7A7AA7ACC AA7 AGA7 AA T6AT 
ACT ACCAGCT ATACGTTGACAAGTTGTAACACCTCAGTCATTACAC AGGCCTGT 
CCA A AGG7 A7CC777GAGCCAA7TCCCA7ACA77 A77G7GCCCCGGC7GG777 7 
GCGATTCTAAA ATGT AATAATA AGACG7TCAATGG AACAGGACCA7GTACAAA T 
G7CAGCACAG7ACAA7G7ACACATGGAA7TAGSCCAGT AG7 ATCAACTCAACTG 
CTGTTAAATGGCAGTCTGGCAGAAGA AGAGGT A5TAATTAGATCTGCCAATTTC 
ACAGACAATGCTAAAACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAAT 
TGTACAAGACCCAACAACA AT AC A AG AAA A AGT A TCC6T A TCC AG AG A GGACC A 
GGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGT 
AACATTAGT AG AGCA AAATGGAATAACACTTTAAAACA6ATAGAT AGCAAATT A 
AG AG AAC A ATTTGG AAAT A AT AA AACAATAATCTTT AAGCAGTCCTCAGG AGGG 
GACCCAGAAATT6TAACGCACAGTTTTAATTGTGGA6GGGAATTTTTCTACTGT 
AATTC AACACAACTGTTT AATAG7ACTTGGTTTAATAGT AC7TGGAGTACTAAA 
GGG7C A A AT AAC ACTG A AGGAAGT GAG AC A ATCACCC7CCCA7G CAG A A7 AAA A 
CAAATTATAAACA7GTGGCAGGAAG7AGGAAAAGCAA7G7A7GCCCC7CCCA7C 
AGTGGACAA AT7AG A7GT7CA7CAA ATA77 ACAGGGC7GC7A77AACAAGAGA7 
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Table 15A (cant.). 

GGTGGT AAT A GC A AC A ATG AG TCCGAGATCTTC AG ACCT GGAGG A GG AG AT ATG 
AGGGACAATTGGAGAAGTGAATT ATATAAATATAAAGTAGTAAAAATTGAACCA 
TTAGGAGTAGCACCCACCAAGGCAAAGAGAAGA13TGGTGCAGAGAGAAAAAAGA 
GC AGTGGGAAT AGGAGCTTTGTTCCTTGGGTTCT7GGGAGCAGCAGGAAGCACT 
ATGGGCGCAGCGTCAATGACGCTGACGGTACAGGCCA6ACAATTATTGTCTGGT 
ATAGTGCAGCA6CAGAACAATTTGCTGAGGGCTAT7GAGGCGCAACAGCATCTG 
TTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTG7G 
GAAAGAT ACCT AA AGGATCAACAGCTCCTGGGG ATTTGGGGTTGCTCTGG AAAA 
CTCATTTGC ACCACTGCTGTGCCTTG6AATGCTAGTTGGAGTAATAAATCTCTG 
GAhCAGATTTGGAATAACATGACCTGGATGGAGTGGGACAGAGAAATTAACAAT 
TACACA 
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The claims defining the invention are as follows: 

1, A process, for stimulating a lymphocyte proliferative response i 
humans which comprises treating humans in need of stimulation of a 
lymphocyte proliferative response with a recombinant HIV portion of an 
HTLV-III protein selected from the group consisting of RIO, PB1 > 590 and 
KM, wherein members of said group are as hereinbefore described. 

2- A process, according to claim 1, wherein said recombinant HIV 
portion is the HTLV-III protein portion of RIO. 

3. A process, according to claim 1, wherein said recombinant HIV 
portion is the HTLV-III protein portion of PB1 . 

4. A process, according to claim 1, wherein said recombinant HIV 
portion Is the HTLV-III protein portion of 590- 

5- A process, according to claim 1, wherein said recombinant HIV 
portion is the HTLV-III protein portion of fCHK 

DATED this SEVENTH day of MARCH 1991 

Repligen Corporation 



Patent Attorneys for the Applicant 
SPRUSON & FERGUSON 



3440 bp 



sex* 



Ndel 



pBGl 
(5600 bp) 



Figure 1. Construction of expression 
vector pREV2J2 



2445 bp 




1. Nd el 

2. Ligation 



Ndel 



EcoRI Bd 
AatU ^ x 



Bell 




1. Bci-EcoRL 

2. Isolate 2445 bp fragment 



3. Location 



AaUl 



A 

Kpni 



AaU! 
AatU I EcoRI 
Xrnnl X L ^ 



i. Aatu 



Kpnl 



SG 



2. LIcate 



EcoRi 



Synthetic 
multiple 
cloning site 
fragment 

(Table 1} 



AatU 



Synthetic 
fragment 
containing 
TrpA 

transcription 
terminator 

(Table II} 



AalJI 



> • • 



• a » 

• mm 

> • • 

• CO 



• • • 

• o e 



o • « 



Xnnl 




Bell 



Figure 1. continued 
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Figure 2. Schematic of pREV2.2 and of Multiple Cloning Site 
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Figure 3. Schematic of HTLV-III envelope gene and recombinant proteins 
obtained therefrom. 
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